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I: OPEN-CLOSED MINDEDNESS AND TEACHER EDUCATION- -THE PROBLEM 



The research reported on the following pages is related to two types 
of endeavor: The first is the training of prospective teachers of the 

social studies; the second is the definition of populations in studies 
of college students in teacher education. The common thread making these 
two areas compatible in one research study is a concern with dogmatism— 
the open-closedness of belief systems— as it relates to teacher education. 

Social studies educators have long proclaimed the central role of 
instruction for critical or reflective thinking in the public schools. 
Nevertheless, training programs for social studies teachers do not include 
courses that would give the prospective teacher a conceptual framework 
from which to teach his students to think in a more systematic manner 
(see, e.g., Ballinger, 1963). Logic courses are, for example, not typi- 
cally required in teacher preparation programs, and there are few other 
courses at the university that focus on thought process rather than con- 
tent. Those that do focus on process, such as courses in educational 
research, are usually graduate courses. In any event, they frequently 
are of little relevance to those problems about which teachers want their 
students to reflect. 

The social studies curriculum and methods course universally required 
of the prospective social studies teachers provides one place where instruc- 
tion might focus on analytic frameworks appropriate for the social studies 
curriculum. In order to use the course for that purpose, however, the 
instructor would have to abandon some traditional shibboliths of teacher 
education— for example, that it is important that methods courses teach 
the prospective teacher to plan lessons and units. Moreover, if the pro- 
spective teacher is to properly integrate an analytic scheme into his own 
conceptual frame and if he is to experience models of teaching appropriate 
to the involvement of students in thinking, the traditional college lec- 
ture approach to teaching would probably have to be largely abandoned in 
favor of a discussion technique (see, e.g., McKeachie, 1963, pp. 1126-27). 

The research in this report stems from an effort to depart from the 
traditional pattern of social studies methods courses in order to do two 
things: (1) concentrate on forcing the students to examine the assumptions 

underlying the curricular decisions they would be making as social studies 
teachers (such as those about the nature of democracy), and (2) help pro- 
spective teachers to explore the reflective competencies that mignt be 
helpful in teaching their students to analyze public issues. Class sessions 
were largely discussions of readings with socratic-type teacher-student 
interactions, with few lectures. 

Not too surprisingly, perhaps, the students' reactions to the course 
were mixed. Reactions did not, however, seem to be evenly spread over a 
a continuum of favorableness— non-favorableness, but tended to fall into 
dichotomies. Some students were quite disturbed by the course. It was 
obvious to them that the course was not giving them "practical" prepara- 
tion for their first confrontation with the students whom they would face 
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in their practice teaching within a quarter or two. Moreover, the discus- 
sions were frustrating because ready answers were not provided to be 
written down in neatly organized notebooks. As one student wrote on an 
evaluation form. "The class is unfair because we are tested on how well 
we think instead of what we know." Other students reacted positively to 
the course. They remarked that it was one of their first opportunities 
to think in a college course, instead of simply memorizing lecture notes 
and underlined sentences in a textbook and then regurgitating them on 
tests. These same students frequently commented on their previous lack 
of awareness of the complexity of curricular decision-making in the social 
studies and their rejection of the traditional history oriented, read-and 
recite pattern of social studies instruction. The tendency for students 
to find the course interesting or to be disturbed by it was reflected in 
their evaluations of the instructor obtained each quarter. 

McKeachie (1963) has noted that instructors who depart from the role 
expected by college students are likely to be rated lower than more con- 
ventional instructors. He also suggests that while the relationship be- 
tween student satisfaction and learning tends to be low, instructors usu- 
ally would like their students to have "warm feelings" about their class 
room experiences. There even is some evidence that students with such 
feelings may be more likely to be interested in learning more in the area. 
Other work by McKeachie (1961), as well as by Oliver and Shaver (1962, 

1966; Shaver and Oliver, 1968), suggests that the differing reactions of 
students to a course may be related to differences in personality charac- 
teristics. 

In reflecting about the social studies course to which the students 
were reacting so diversely, it seemed clear that it did make certain demands. 
Students were required to shift their conceptions of the proper instructor 
role. Moreover, the instructor did not provide a clear model of authority 
as the lecturer so often does. The analysis of assumptions underlying 
curricular decisions called for an openness in belief systems, and the con- 
cepts of reflective thinking that were taught emphasized a tentative ap- 
proach to beliefs and the willingness to tolerate conflicting values (see, 
Oliver & Shaver, 1966, Chapter 7). 

Rokeach's work on dogmatism (1960) has provided the Dogmatism Scale, 
"designed to measure individual differences in the extent to which belief 
systems are open or closed" (Rokeach, 1960, p. 19). The Dogmatism Scale 
grew out of earlier work in the area of authoritarianism out of which came 
the F-Scale (Adorno, et. al., 1950), frequently criticized as a measure of 
authoritarianism of the political right rather than a general measure of 
authoritarianism (e. g., Rokeach, 1960; Christie & Jahoda, 1954). The var- 
iables measured by both scales seemed likely to be related to the reactions 
of students to the revamped social studies curriculum and methods course. 
Rokeach also used the Gough-Sanford Rigidity Scale in attempting to dis- 
criminate tenacity of individual beliefs from the tenacity of belief sys- 
tems. That scale also held promise for giving a more complete view of 
tendencies toward lack of flexibility and openness that might relate to 
reactions to a mode of instruction. The Gough-Sanford Scale seemed parti- 
cularly appropriate because a common complaint of those who disliked the 
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course was a lack of specificity in assignments and in information to be 
recorded in notes. There are in the literature no reported studies dealing 
directly with the relationship of dogmatism, authoritarianism, and cogni- 
tive rigidity to reactions to a new course emphasizing analysis. This lack 
of information is the basic problem with which the research was initially 
concerned. 

As already noted, however, the project had a multiple thrust. Working 
with established groups— such as students signed ud for or assigned to 
class periods— rather than random selection and assignment is the conmon 
modus operandi of educational researchers. This was, as already indicated, 
the case with the research to be presented in this report. Generally, 
researchers make the assumption that their "available" samples do not devi- 
ate significantly from population parameters. Unfortunately, this assump- 
tion is rarely checked. 

Evidence, to be noted in the review of literature, indicated that 
personality test scores for the particular sample of s^idents with which 
the proposed study was to be carried out would likely be biased (in the 
sense that repeated sampling from the group would not produce distributions 
clustered about the population mean). Previous studies had indicated that 
authoritarianism and dogmatism are related to religious affiliation. The 
central tendency of our sample, coming from a state university in Utah, 
could be expected to be positively skewed on the scales because fundamen- 
talism, such as that of the Latter-Day Saints (Mon. ion) Cnurch, has generally 
been positively related to scores on the F- and D-Scales. However, no 
findings have been reported on the relationship between the scales and reli- 
gious fundamentalism with students in teacher education. Yet this could be 
an important factor as researchers draw samples of students from different 
sub-cultures. Moreover, reported evidence was inconclusive on the relation- 
ship of the F- and D-Scales to factors such as the level of teaching (ele- 
mentary or secondary) anticipated and the subject area being prepared for. 

The lack of information was a problem for this research. Data about the 
fluctuation of F-, D-, and R-Scale scores as other sample characteristics 
varied and on the relationship of these scales to other information about 
teachers seemed likely to provide information of use to other researchers, 
who might want to judge the impact of bias on their samples. 

One other auxiliary concern became the subject of the project's inves- 
tigations. Common stereotypes include the notion that Germans are parti- 
cularly authoritarian. However, evidence as to the relative authoritarianism 
of comparable groups of Germans and Americans, such as prospective school 
teachers, is lacking. When an opportunity to administer the Dogmatism, F-, 
and Gough-Sanford Rigidity Scales to German teacher education student.; be- 
came available through the services of a colleague born and raised in Ger- 
many, the opportunity to obtain comparative data was seized upon. 

The study, then, was aimed at three related problems: The lack of 
information about the relationships of dogmatism, authoritarianism, and 
cognitive rigidity to reactions to an "innovative" course; the lack of 
data to use in establishing whether a particular sample was biased on mea- 
sures of dogmatism, authoritarianism, and cognitive rigidity ; and, the 
lack of comparative data on dogmatism, authoritarianism, and cognitive 
rigidity of American and German students in teacher education. 
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II: AUTHORITARIANISM AS A VARIABLE IN EDUCATIONAL RESEARCH 



The history of the F-Scale is well known among social psychologists. 
First reported in the monumental book. The Authoritarian Personality, 
(Adorno, Frankel-Brunswik, Levinson & Sanford, 1950), the F-Scale, intended 
to measure "implicit antidemocratic trends," has come to be called the 
Authoritarianism Scale. Although the impetus for the study by Adorno and 
his colleagues was the horrifying manifestations of prejudice in Germany 
prior to and during World War II, the groundwork had been laid in studies 
by people such as Stagner (1936; 1954), Maslow (1943), Edwards (1941, 1944), 
and Fromm (1941). 

Probably no other research report has been so prolific a source of 
further research progeny. In a review of research stemming from The Author- 
itarian Personality up to 1955, Titus and Hollander cited 64 studies, while 
Christie and Cook (1958) reviewing research through 1955-56 cited 230 re- 
ports. Few works have had a greater contemporary impact on professional 
and lay people (Christie & Cook, 1958, p. 171), probably because of the 
social climate of the times (Brown, 1965, p. 478). There is no reason to 
dispute the claim by Titus and Hollander (1957, p. 47) that no other per- 
sonality variable has been as widely used in research. Titus and Hollander 
(1957) reviewed studies relating the F-Scale to prejudice, political atti- 
tudes, family ideology, teacher attitudes, attitude change, social percep- 
tion, rigidity, neuroticism, minority group membership, leadership, group 
behavior, and subcultural differences. In their more extensive review, 
Christie & Cook (1958) discussed the F-Scale under headings such as poli- 
tical attitudes, child rearing, inter-personal behavior, prejudice, psycho- 
pathology, and minority group membership. They also included a section in 
which they reported studies relating F-Scale scores to miscellaneous vari- 
ables ranging from agreement with the position of the Catholic Church to 
ego strength and self concept. 



Studies Since 1956 * 

Since the Titus-Hol lander and Christie-Cook reviews, research with the 
F-Scale has continued at a rapid rate. Although the original impetus for 
the authoritarian personality studies was a concern for fascism and its 
rather extreme political -social ramifications, it is not surprising that 
the F-Scale has been applied to the study of persons with less extreme 
political positions. For example, a study by Harned (1961) indicated that 
high authoritarians tend to be more oriented toward formal organization 
and less ideological in their relations to political parties than low 
authoritarians. Zippel and Norman (1967) found that supporters of Johnson 
and Goldwater ir. their sample did not have different mean F-Scale scores, 
although those who switched parties did have higher mean scores. One 
study found supporters of Stevenson and Humphrey to have lower F scores 



♦The review that follows will concentrate on studies using the F-Scale 
since the Christie-Cook and Titus-Hol lander reviews, although prior stu- 
dies which are particularly relevant will occasionally be cited. Our review 
is more extensive than would be strictly necessary as background for the 
study. The intent has been to provide an up-to-date compendium of F-Scale 
(and Dogmatism Scale) studies. 
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than supporters of Faubus and Johnson (Wrightson, Radi off, Harton & Mecher- 
iskoff, 1961). Remmers and Steinberg (1954) found a slight correlation 
between F-Scale scores and political party (Republicans having higher 
scores), while, among Egyptians, negative correlations have been found be- 
tween authoritarianism and liberalism (Melikan, 1959). 

Notions about the relationships between conformity and how people 
relate to power have also prompted further studies. Conformity has been 
founu to be slightly, although significantly, related to authoritarianism 
(Vaughn & White, 1966), unrelated (Hardy, 1957; Endler, 1961), and depen- 
dent upon the reasonableness of the position advocated as well as the 
prestige of the source (Johnson & Steiner, 1967). Wells, Weinhert, and 
Rubel (1956) reported that, in judging who was at fault in an auto acci- 
dent scent, high F scorers (on Gough's 30 item form) were more suscepti- 
ble to pressure by confederates. Weiner and McGinnes (1961) using students 
in an introductory psychology course found that when asked to judge whether 
a face projected on a screen was smiling or not, high and low F scorers 
did not differ significantly in conformity to confederate pressures. This 
finding corresponds with that by Baron and Arenson (1967) that high F scor- 
ers did not differ from low F scorers in susceptibility to social influence 
or in tending to gamble in a gambling game. As is so often the case in 
research with the F-Scale, differing subjects and different tasks make 
interpretation of divergent findings difficult. 

Smith (1967) investigated authoritarianism as it related to power 
behavior in triads. Wilkins and DeCharms (1962) found high authoritarians 
to rely more on external power cues in describing other persons, and to 
distinguish more sharply between persons on the basis of power cues such 
as socio-economic status. Preference for greater departmental control of 
college instruction was significantly related to F-Scale scores in a sample 
of college students (Bendig & Hountras, 1959). F-Scale scores have also 
shown a positive correlation (r = .56) to acceptance of military ideology 
as measured by a Military Ideology Scale (French & Ernest, 1956). 

Along the same lines, subjects who were obedient when told to give a 
high voltage shock to a fellow subject had a higher mean F-Scale score 
than those who defied the command (Elms & Miligram, 1966). Berkowitz and 
Lundy (1957) found that high F-Scale scorers tended to be influenced by 
authority figures when asked to state opinions on matters such as movies 
and tariffs. Harvey and Beverly (1961) reported that in an investigation 
of the effect of role playing on concept change, there was a greater effect 
on high authoritarians, with the status of the source also having a greater 
effect with the high F group. 

Weitman (1962) found pro-authoritarians (those concerned with author- 
ity and obedience) to have higher mean F-Scale scores than anti -authori- 
tarians and non-authoritarians. However, in a later study (Weitman, 1964), 
conformists, as determined by four tests including the F-Scale, did not 
differ significantly from rebels and independents in their responses to 
an incomplete sentences scale. Rudin (1961) found a nonsignificant rela- 
tionship between scores on the F-Scale and scores on his Authority Scale. 
LaGaipa (1957) found that cartoons dealing with aggression toward authority 
were seen as less humorous by high F scorers under low ego involvement 
conditions, but more humorous under high ego involvement conditions, than 
was the case with the low F scorers. Lipetz and Ossorio (1967) reported that 
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low authoritarians were more hostile than were high authoritarians toward 
conditions in which they interacted with a high status target. 

In an investigation of aggression toward out-groups (Epstein, 1966), 
subjects with high F-Scale scores were more aggressive than those with 
low F scores. Although a negro model generally elicited greater aggression 
(degree of eldctric shock administered was the criterion), the effect was 
greater among 'low F subjects. The lower aggression evidenced by high author- 
itarians may be a function of a stronger defense mechanism. This conjecture 
is supported by the findings of Katz and Benjamin (1960) that in bi racial 
work groups, high authoritarians rated negroes higher than did low author- 
itarians. 

The review by Titus and Hollander (1957) noted that no clear cut 
relationship between anxiety and authoritarianism had been established. 

In an attempt to clarify the situation, Davi is and Eriksen (1957) admin- 
istered the F-Scale and thp Manifest Anxiety Scale to a group of naval en- 
listed men. No significant relationship was found. Noting that their 
Navy men had a significantly higher mean F-Scale score than a group of 
college students, Davids & Eriksen concluded that the varying relation- 
ships found between the F-Scale and indices of neuroticism are likely due 
to the effects of differing socio-cultural settings. 

Research since 1956 has confirmed the results of studies reviewed 
by Christie and Cook (1958) in regard to race and authoritarianism. Negroes 
seem to have higher mean F-Scale scores than whites according to studies 
with a college sample (Kelman & Barclay, 1963), graduate students in edu- 
cation (carried out before the Christie and Cook review, but not reported 
in it) (Remmers & Steinburg, 1954), and a large (N = 2079) nationwide sample 
of high school and college students (Greenburg & Fare, 1959). The corre- 
lation between F scores and scores on the E- (ethnocentrism) Scale have 
been found to be lower for a group of negroes than for whites, suggesting 
sympathy for other minority groups (Taylor, 1962). 

Other studies have related the F-Scale to an incredible variety of 
variables. Authoritarianism scores have been found to be correlated with 
parental attitudes toward the authoritarian control of children (Zuckerman 
& Oltean, 1959). Mothers of schizophrenics scored higher on the F-Scale 
than did mothers of nonpatients, while the patients and their mothers did 
not differ (Divorin & Wyant, 1957). With a sample of mothers and second 
and third grade children, the psychological adjustment of children of 
authoritarian and equal itari an (as measured by the F-Scale) mothers did not 
differ (Kent, 1965). However, there were indications of beginnings of differ- 
ences in the personality functioning of the two groups of children. 

Attitudes toward old people were unrelated to f scores (Kogan, 1961), 
but authoritarianism was found to be related to scores on scales measuring 
open subordination of women and chivalrous attitudes toward women (Nadler 
& Marrow, 1959). And, Budner (1960) found F-Scale scores to be negatively 
correlated w’th attitudes toward socialized medicine in a sample of med- 
ical students. Budner's findings agree with those of Mohler (1953), but 
disagree with those of Li bo (1957). 
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Among other studies, Kidd and Riviore (1965) found a curvilinear rela- 
tionship between F-Scale and acceptance of fantasy scores. Eliasberq and 
Steward (1961) found authoritarianism to be highly related (r = .445) to 
ratings of pornography in art. Jacoby (1967) related authoritarianism to 
anxiety and birth order, while Bergum and Lehr (1963) investigated authori- 
tarianism and vigilance performance and Marcia (1967) investigated authori- 
tarianism and 090 identity. In a study by Kimbrel i and Blanchard (1964), 
persons discharged as ward attendants in a mental institution had lower 
F-Scale scores than those employed. 

Singer (1961) reported that verbal reinforcement affected scores on 
the F-Scale, and Kinnick and Plattor (1967) reported that an eleven-week 
institute dealing with the problems of school desegregation not only brought 
about greater tolerance and acceptance of negroes and desegregation, but 
resulted in the expression of less authoritarian attitudes. Heist (1960) 
found that students attending colleges ranked high on a productivity cri- 
terion had lower mean F-Scale scores than those attending colleges ranked 
low. Also, a significant, but low, positive correlation has been found 
between F-Scale scores and preference for television lectures in a general 
social science course (Janes, 1964). 

Alper, Levin, and Klien (1964) renorted findings supporting the notion 
that high authoritarians tend to be moralistic (as did Smith, 1965), but not 
supporting the idea that low authoritarians tend to be humanistic. (Inter- 
estingly, Eisenman, 1968, has reported that instruction about the authoritar- 
ian personality in introductory psychology courses significantly reduced the 
severity of the subjects' moral judgements. Williams (1963) found high F 
subjects to be more willing than low authoritarians to serd troops to Cuba 
to recover a hijacked plane. Trusting and trustworthiness were negatively 
correlated with authoritarianism in a study by Deutsch (I960), 

A significant correlation (r = .31) was found between rejection by 
peers (high school students) and F scores (Frymier, 1959c). But Eisenberg 
(1966) did not find a significant relationship between depression and 
authori tarianism. 

Burgess (1968) reported that girls who later chose atypical vocations 
had significantly lower F-Scale scores in junior high school than either 
boys or girls who later chose typical vocations. Becker (1964) investigated 
the relationship between authoritarianism and choice of a marriage mate. 

In a study by Ramirez (1967), Mexican-Americans scored higher on the F- 
Scale than middle-class college students. Ramirez also found a signifi- 
cant positive relationship between F-Scale scores and agreement with Mexican 

family values. 

Of the great variety of studies utilizing the F-Scale, some are parti- 
cularly relevant to a concern with the effects of selection bias on F-Scale 
scores. A brief review of these studies is in order. 



Age , Education and Authoritarianism 

Despite the common folklore that older people are more conservative 
and authoritarian, researchers have not set up studies specifically to 
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investigate the relationships between authoritarianism and age. Any corre- 
lations reported are incidental products of the study of other variables. 

This creates some problems of interpretation, as designs are not set up 
to get at any age-authoritarianism relationship. Often the result has 
been a confounding of age and education. As an example. Bendig and Houn- 
tras (1959) reported that, although age was not a consideration in their 
design, graduate students tended to have lower mean F-Scale scores than 
undergraduates (p < .10). 

The picture in regard to the relationship between age and authori- 
tarianism is a confused one. Plant (1965) reported that, with a sample of 
people with varying amounts of college education who had been tested over 
a four-year period using Gough's modified F-Scale, authoritarianism decreased 
with age regardless of education. Kelman and Barclay (1963) reported that 
17 to 19 year olds in their sample had a higher mean F-Scale score than 
those 20 years and older (p < .05). In a study using the Children's Author- 
itarianism Scale with a sample of upper middle class Jewish children ranging 
from nine to thirteen years of age, older children had significantly lower 
authoritarianism scores (Siegman, 1957). Lindgren (1962), with a sample 
of 150 laundromat customers, reported a nonsignificant correlation between 
scores on a seven item F-Scale and age. Jones and Gaier (1953) reported a 
nonsignificant correlation between a combined E-Scale and F-Scale score 
average for teachers, pre- teachers, and non- teachers at the University of 
Illinois. 

On the other hand, Wilcox (1957) found older teachers, principals, and 
supervisors to be more authoritarian. Gregory (1957), with a sample ranging 
in age from 21 to 71, found a positive correlation (r = .52) between F- 
Scale scores and age. (Perhaps restrictions in age range have been the 
cause of some nonsignificant correlations.) However, in Gregory's sample 
the subjects from 21 to 30 years of age were almost all from college groups; 
the older subjects were from church groups. Therefore, age was confounded 
with college and church attendance. A significantly higher correlation 
between scores on a 25 item F-Scale and x anti -semi tism scores was reported 
for an older group in a Dutch sample (Weima, 1965). 

Generally, authoritarianism scores appear to decrease with education, 
although Lindgren (1962) found a nonsignificant relationship between educa- 
tion and authoritarianism. Interestingly, Greenburg, Marvin, and Bivins 
(1959) reported that high school seniors in West Texas who planned on atten- 
ding college were significantly less authoritarian than those not planning 
to do so. Eisenberg (1966) reported a decrease in F scores accompanying 
education with a sample of psychiatric and normal subjects. Plant ( 1 966 ) 
found a decrease over a two-year (freshman to sophomore) period at San Jose 
State College. Fox (1965), using his own measure of authoritarianism, 
reported that seniors at a Catholic college had significantly lower scores 
than freshmen. Also, Cohn and Carsch (1954) found that mean F scores 
decreased with level of education in their sample of German workers. To 
what extent these results are due to education and to what extent they are 
due to experimental mortality is not clear. However, a finding by Trent 
and Craise (1967) indicates that education is a potent factor. They used 
a Nonauthoritarianism Scale based on the F-Scale. There were marked and 
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significant increases (in autonomy) for subjects consistently in college 
for four years, with marked decreases for those consistently employed during 
the same period. 



Sex and Authoritarianism 

As with age and education, the relationship between sex and authori- 
tarianism is not clear cut. In studies using younger subjects, there 
appears to be no difference between females and males. This has been the 
case with junior high school students (Burgess, 1968), high school students 
(Greenberg, Marvin & Bivins, 1959), a mixed high school and college sample 
(Greenberg & Fare, 1959), and with 9 to 13 year olds using the Children's 
Authoritarianism Scale (Siegman, 1957). However, Livson and Nichols (1957) 
found that there was little similarity in inter-item patterns for adoles- 
cent males and females, even though the mean scores were not significantly 

different. 

At the same time, Bendig and Hountras (1959) reported that with a 
sample of college students, males had significantly lower (p < .05) F- 
Scale scores than females. With a sample of 282 negro college freshmen, 
females also had significantly higher (p < .001) authoritarianism scores 
(Kelman & Barclay, 1963). Wilcox (1957) reported that for his sample of 
teachers, principals, and supervisors, women were more authoritarian. Lambert 
(I960) found women to have higher scores on the average in a sample of prin- 
cipals and teachers, but apparently the trend was not significant. Perez 
(1966) did not find female teachers to be significantly higher than male 
teachers on the F-Scale, nor did Jones and Gaier (1953) find a significant 
sex difference. However, McGee's (1955) male teachers had a significantly 
higher mean F-Scale score than the female teachers, as did the male educa- 
tion students in Bendig and Hountras' (1959) sample. Although Kates (1959) 
reported a higher mean score on the F-Scale for males, it is not clear what 
the age level of his sample was. Nor was significance reported. To add to 
the confused findings, Denmark and Diggory (1966) reported that, in reacting 
to questionnaire statements based on discussions of the authoritarian person- 
ality, men approved authoritarian leadership more than did women. 



Intelligence and Authoritarianism 

In the reviews by Titus and Hollander (1957) and Christie and Cook 
(1958), a general negative correlation between F-Scale scores arid intelli- 
gence was noted. Research since 1956 has supported this generalization. 
Siegman (1957) found low scorers on the Children's Authoritarianism Scale 
to have significantly higher (p < .01) verbal intelligence scores. Jacobsen 
and Rettig (1959) reported a negative correlation (r = - 393, p < .001) 
between F-Scale and total American Council on Education Psychological 
Examination scores (r = -.224 for quantitative and -.262 for verbal scores). 
Scores on the Abstract Reasoning Test of the Differential Aptitude Tests 
were negatively correlated (p < .025) with F scores for a group of sopho- 
mores from Yeshiva College (Klein, 1966). And, a negative correlation 
(r = -.24, p < .05) between authoritarianism scores and scores on the Naval 
General Classification Test was reported for a group of naval enlisted men 
(Davids & Eriksen, 1957). 
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Geographio Location, College, and Authoritarianism 

As samples of subjects are drawn from different parts of the country, it 
is tempting to attribute differences among them to the factor of geographic 
location. Especially is this true when one is dealing with colleges located 
in rather distinct geographical areas. Undoubtedly, it is sounder to think 
of any variations as related to, if not due to, the subcultural variability 
from one area of the country to another which may or may not bear a func- 
tional relationship to geography. The importance of cultural variables is, 
for example, emphasized by the results, reviewed earlier, of comparing the 
F-Scale scores of negroes and whites. Some geographic areas do have a heavy 
concentration of negroes, and this might have serious implications for sam- 
pling if responses to the F-Scale were of concern. 

Titus and Hollander (1957, p. 59) noted some studies dealing with "var- 
iation within the United States." Christie and Garcia (1951) found that a 
group of students at a university in a "southwestern city" were more accep- 
tant of F-Scale items than a group of students from the University of Cali- 
fornia. In another study (Bass, McGehee, Hawkins, Young, & Gebel , 1953), 
southern students were found to have about the same mean scores as the Cali- 
fornia group, but a more restricted range. Using a sample of male students 
from the City College of New York, Davidson and Kruglov (1953) found birth- 
place and parents' occupation to be unrelated to F scores. 

Since the Titus and Hollander review, other studies have investigated 
F-Scale variability within the United States^ Frymier (1959a, 1960a), 
working with high school students, has found significant differences, in 
descending order, among the F-Scale means of subjects from Alabama, Flor- 
ida, and Michigan. 

A number of studies have used college students as samples, (A summary 
of available means is presented in Table 1). Kelman and Barclay (1963) 
investigated various factors related to the F-Scale scores of freshmen at 
a negro college in Maryland. They found a significant difference between 
negroes born in the South and those born in Maryland and other border 
states, uex (1959) referred to a study by Stott (1954) in which students 
from the University of Utah were found to be more authoritarian than the 
students from the Universities of California and Oregon reported in earlier 
studies (Adorno et al., 1950; Christie & Garcia, 1951). Greenberg and 
Fare (1959) investigated a number of factors which might be related to the 
authoritarianism of 2079 high school and college students from the east, 
midwest, and southwest. They found a significant difference in mean F 
scores for subjects grouped by section of country (eastern college students 
had a lower mean score than did southwestern college students), race, college 
major, and degree of blindness. In another study (Wrightson et al., 1961), 
southern college students tended to have higher mean scores on the F-Scale, 
but regional differences were not clear cut. 

Gaier and Bass (1959) investigated the correlations between the F-Scale 
and the E and Social Acquiescence Scales with students from Washbaine Uni- 
versity in Kansas, the University of Maryland, and Louisiana State University. 
Correlations were similar for the Maryland and Kansas samples, but consider- 
ably lower for the L.S.U. sample. As is so often the case, no report of 
sampling technique was given, leading one to wonder if sample differences 
might be due to sampling biases unrelated to the subcultures supposedly 
represented by the samples. Moreover, the general failure to investigate 
the characteristics upon which the various college samples might differ, such 
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as religious affiliation, makes the findings of research into subcultural 
variations difficult to interpret. 



TABLE 1 

Summary of F-Scale Means And Standard Deviations 
From Previous Studies Using College Students 



Author(s) 


University 


Mean 


Standard 

Deviation 


Adorno, et al. (1950) 


U. of Oregon & 

U. of California 


3.51 


.92 


Christie & Garcia (1951) 


U. of California 


3.30 


.83 


Southwest City U. 


4.10 


.77 


Stott (1954) 


U. of Utah 


4,07 


.79 


Kelman & Barclay (1963) 


Maryland negro 
college 


4.54 


.84 


Kerlinger & Rokeach (1966) 


Michigan State U. 


3.49 


.63 


Louisiana State U. 


3.57 


.69 




New York U. (Div. 


3.66 


.79 




Gen. Educ.) Combined 


3.56 


.70 



Religion and Authoritarianism 

Religious belief, as an indication of social conventionalism and moral- 
ism, was a concern in the construction of the F-Scale (Adorno et al, » 1950, 

• Ch. 7). Allport (1964) has discussed the possible relationships between 
authoritarianism, conformity, religion, and prejudice, as has Rokeach (1965). 
O'Neil and Levinson (1954) concluded that the £-, F-, Religious Convention- 
alism, and Traditional Ideology Scales consisted of overlapping factors. 

A relationship between F-Scale scores and religious belief and affiliation 
has continued to be borne out by research. 

Titus and Hollander's review (1957) included only one study dealing 
with authoritarianism and religion. Levinson and Schermerhorn (1951) found 
authoritarianism to be greater among Catholics than among "conventional 
Protestants, and greater among the latter than among "humanistic Protestants 
(Unitarians and Congregationalists) and Jews. Christie and Cook (1958) 
cited only two related studies. Sarnoff (1951) found relationships between 
religious affiliation and attendance and his Scale of Jewish Authoritarianism. 
Interestingly, "both those who rejected all Jewish affiliations and those 
with the most complete Jewish identification tended to reject the authori- 
tarian items" (Christie & Cook, 1958, p. 186). A study of the relationship 
of F-Scale scores to agreement with the position of the Catholic church 
(O'Reilly & O'Reilly, 1954) was also cited. 
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A variety of findings has since been reported. Only a few studies have 
investigated directly the relationship of church affiliation with author- 
itarianism. Rhodes (1960), with a sample of 1027 seniors from 8 Tennessee 
high schools, found that authoritarianism varied directly with the Protestant 
fundamentalism of his subjects. He also found that F-Scale scores were not 
independent of social status or the place of residence (rural or urban) of 
the students. There was greater variation among the F scores of Protestants 
than there was between Protestants and Catholics. Significant differences 
among mean F scores for groups of high school students from Alabama, Flor- 
ida, and Michigan (in that order) (Frymier, 1959b), likely reflected reli- 
gious differences. The Alabama group was 81 % Baptist; the Florida group 
was 49 % Baptist, 40 % other Protestants, and 6 % Catholic; and the Michigan 
group was 16 % Catholic and 83 % other Protestant. Wrightson, et al . (1961) 
reported that in their sample of college students from 9 colleges throughout 
the country, Jews and nonchurch members had lower mean F scores. (They also 
tended to be more cynical towards the motives of others.) 

Reported church attendance has been the concern of some studies. Some 
studies have reported that church attendance and authoritarianism are not 
significantly related (Jones & Gaier, 1953; Frymier, 1959b). On the other . 
hand, Remmers and Steinberg (1954) found a significant, if low, relationship 
(r = .19, p < .01), and Siegman (1957) discovered a curvilinear relationship 
between strictness of religious observance and Children's Authoritarianism 
Scale scores for his Jewish sample. 

Several investigators have correlated F-Scale scores with scores on 
scales of religious belief. The All port- Vernon Study of Values was used in 
two studies. Newsome and Gentry (1963) failed to find a significant rela- 
tionship between school superintendents' scores on the Webster, Sanford, 
Freedman version of the F-Scale and the religious subscale of the Allport- 
Vernon instrument. However, Nolan (1963) working with high school seniors 
in a southwestern city and using the standard F-Scale, found a low but 
significant negative relationship (r = -.163, p < .05). The relationship 
was higher for males (r = -.340) than for females (r = .023). Gregory 
(1957) reported a correlation coefficient of .53 between F scores and 
scores on a test purported to measure orthodoxy or fundamentalism of reli- 
gious attitudes. Correlating F-Scale and Religious Attitude Scale scores 
for a group of male undergraduates resulted in a coefficient of .53 (Sieg- 
man, 1962). And, a significant correlation (r = .161, p < .05) was found 
between F scores and scores on a Scale of Attitude toward Social Institu- 
tions (including the church) (Remmers & Steinberg, 1954). 



Authoritarianism and Teacher Education 

It has been argued (e.g., Friedenberg, 1959) that selective factors 
operate to attract into the teaching profession people who are conceptually 
"timid and constrained." There are some findings which support the plausi- 
bility of the argument. Friedenberg (1959) cited Travers and others (1953) 
as finding a tendency on the part of their sample of student teachers to be 
rigid when faced with unexpected situations and to lack "high level organi- 
zational ability." Bay (1967, pp. 82-83) referred to a study by Selvin 
and Hagstrom (1965) which found education students at the Berkeley campus 
of the University of California to be less libertarian than social science 
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and humanities majors by a wide margin. A study by Adams , Blood, and Taylor 
(1959) indicated that college students in teacher education are more likely 
to be "docile" than are art and sciences majors. Trent and Craise (1967) 
found education students in a sample from Pennsylvania, the Midwest, and 
California to be lower than liberal arts students on the Autonomy Scale 
of the Omnibus Personality Inventory. 

Several studies using the Edwards Personal Preference Scale (Jackson 
& Guba, 1957; Johnson, 1959; Merrill, 1960) have generally found education 
students and teachers to be higher on the Deference Scale and more orderly, 
abasing, and less autonomous than the liberal arts students used for stan- 
dardizing the test. Cook, LeBold, and Linden (1963) carried out a factor 
analysis of the Edwards Personal Preference Scale and the Guilford-Zimmer- 
man Temperament Schedule with a group of engineering and education students. 
They obtained a factor labeled "authoritarianism" with their education 
students which did not emerge with the engineering students. 

Interestingly, there has been little reported research dealing with 
differences between education and other college majors on the F-Scale, and 
the few reported results do not follow the trend indicated by the non-F- 
Scale studies. Jones and Gaier (1953) found nonteachers at the University 
of Illinois to have significantly higher F scores than preteachers, with no 
difference between nonteachers and teachers. Unfortunately, few renorts of 
research using the F-Scale report means and standard deviations for college 
major subgroups. This makes it impossible to compare new data on education 
students, such as from the present study, to that from previous studies. 

If little has been done in comparing the F-Scale scores of education 
and other majors, how about research comparing education students or prac- 
ticing teachers with different college subject majors? Again, little 
research is available. Comparisons of different teaching majors have not 
been reported. However, some comparisons of elementary and secondary school 
teachers are in the literature. Jones and Gaier (1953) and McGee (1955) 
reported no differences in mean F-Scale scores between elementary and 
secondary teachers, although the first study found that teachers who had 
taught at both levels scored significantly higher on the scale. In one 
study (Wilcox, 1957), the rank order of elementary teachers' mean F-Scale 
score was higher than for secondary teachers, but no test of significance 
was reported. The lack of research may be due to the lack of important 
implications of authoritarianism for the teacher's classroom role. However, 
a number of studies suggest that such is not the case. 



Authoritarianism and Attitudes Toward Children 

Would the relative authoritarianism of teachers as compared to other college 
majors or among students in different teaching majors have any serious 
implications for teacher education? Certainly teacher educators are 
concerned with the attitudes toward children which teachers take into uiS 
classroom. The ability to empathize with students, to put ones' self in 
their place would appear to be an important basis for positive attitudes 
toward students. 
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A number of studies have dealt with the relationship between authori- 
tarianism and the ability to estimate the F-Scale scores of others, one type 
of empathetic behavior. Scodel and Mussen (1953) reported that subjects 
with high F scores showed a greater tendency to assign other people F-Scale 
scores like their own. High F- Scorers also were less accurate than were 
low F subjects in judging the authoritarianism of others. Crockett and 
Meidenger (1956), Kates (1959), and Simon (1966) also found that persons 
with high F-Scale scores showed a greater tendency to assume that others 
had similar scores, although in Simon's study authoritarianism was not 
related to the capacity to estimate its presence in other subjects. Not 
surprising, given the tendency of high F subjects to assign to others scores 
similar to their own, was the finding (Scodel & Freedman, 1956) that high 
F scorers were more accurate in estimating each other's F-Scale scores than 
were low F scorers. Frymier (1960b) discovered that prospective teachers 
erred rather seriously in predicting adolescent responses on many F-Scale 
items, and that a group of experienced teachers did no better (Frymier, 1962). 

Rather than having his subjects estimate the authoritarianism of others. 
Bookbinder (1963) had them rate the personality traits of men and women in 
photographs. With this variation in task, he found no differences between 
high and low F scorers. Hart and Brown (1967) used the F-Scale as one of 
three tests for establishing three groups of student teachers with differing 
"levels of dogmatism." Although those subjects low in "dogmatism" were 
consistently more accurate in their judgements of their secondary school 
students, there was not a significant difference in accuracy of judgement 
among the three groups. 

There is some reason to suspect that authoritarian teachers may have 
difficulty estimating the authoritarianism, if not the other personality 
characteristics, of their students. But, is there any data to suggest that 
attitudes toward children may be associated with authoritarianism? Lindgren 
(1962), with a sample of laundromat customers, found that scores on seven 
items from the F-Scale correlated negatively (r = -.28) with a measure of 
child centeredness. One group of investigators (Johnson, Johnson, & Lea, 

1961) reported that the higher the F-Scale scores of their subjects, the 
more rigid the attitudes toward sex appropriate behaviors for children. 

In a study utilizing 126 mothers and their two-and-one-half year old chil- 
dren (Hart, 1957), a positive correlation (r = , 63) was found between author- 
itarianism and nonlove for children scores. On the other hand, Gallagher 
(1957) reported a nonsignificant correlation between scores on the F-Scale 
and the Children's Attitude Scale, even though high F scorers did tend to 
score higher on the CAS. This divergent finding may be explained by the 
sample— 59 social science majors in a child psychology course. Scores are 
likely to have been quite homogeneous on both scales, making it less likely 
that a significant correlation would emerge. 



Authoritarianism and Attitudes Toward Discipline 

There is some reason to expect that the higher their authoritarianism, 
the more likely it is that teachers will be punitive in their attitudes 
toward children's disciplinary failues. Elms and Milligram's (1966) high 
F scorers were more willing to administer electrical shocks than were their 
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low F subjects. Cuthbert (1967) reported that high authoritarians tended to 
assign more responsibility to the wrongdoer in a story situation than did 
low authoritarians. And, Sherwood (1966) found that F-Scale scores differ- 
entiated between persons who expressed differing degrees of concern with 
blame and punishment after President Kennedy's assassination. He concluded 
that authoritarian adults have conceptions of morality and justice similar 
to the notion of "moral realism" which Piaget found in children from three 
to eight years old. Of particular interest vis a vis possible punitiveness 
toward children is the finding by Roberts and Jesson (1958) that although 
hiqh and low authoritarians did not differ on extrapunitiveness independently 
of the status of "frustrators" in pictures, high authoritarians did tend to 
respond with personal hostility toward low status frustrators and with in- 
direct or displaced hostility toward high status frustrators. 

In line with the above findings are those by Desoto, Kuethe, and Wuder- 
lick (I960) that high F-Scale scorers exhibited greater fear and suspicion, 
and by Deutsch (1960) that subjects classified as suspicious-untrustworthy 
had significantly higher F-Scale scores than those classified as trusting- 
trustworthy . 

In addition, Hoffman (1963) found significant correlations between F- 
Scale scores and power assertion over their children for middle-class mothers 
and lower-class fathers. Zuckerman and Oltean (1959) found F-Scale scores 
to be related to inventory scores indicating favorable attitudes toward author- 
itarian control by parents (r = .51 for psychiatric patients, r - .61 for 
nurses). Kates and Diab (1955) using University of Oklahoma students, came 
up with similar findings. McCandless (1961, p. 385-390) reviewed authori- 
tarianism and child-rearing practices. He concluded that research suggests 
a relationship between the harshness of child-rearing practices espoused and 
authoritarianism, especially where the individuals have direct personal in- 
volvement with children — as is the case with teachers. 



Authoritarianism and Classroom Behavior 

It seems clear that authoritarianism is related to other measures which 
have serious implications for the way in which teachers might behave in the 
classroom. In addition to the findings cited above, authoritarianism scores 
have consistently shown a high negative relationship (r =-.46 to r = --&>) 
with scores on the Minnesota Teacher Attitude Inventory (Del Popolo, 1960, 
Ofchus, 1963; Vacchiano, Schiffman, & Crowell, 1966) which is supposed to 
measure attitudes which predict how well the teacher will get along in his 
interpersonal relationships with students. 

In related fields. Eager and Smith (1952) found significant correlations 
between camp counselors' scores on Sanford's modified F-Scale and children s 
selections of authoritarian items for the counselors on J 9uess who scale 
(r = .53 for boys, .33 for girls). In industry, Mowry (1957 reported that 
the F-Scale correlated with ratings of supervisors' success (r - .49 with 
N =20). 

Several studies using educators have involved indirect measures of 
behavior. Perez (1966) found an inverse relationship between authoritarianism 
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and teachers' scores on a scale designed to measure disposition toward team- 
work. In a study by Wilcox (1957), scores on a scale measuring conservative 
role orientation correlated (r = ,74) with teacher and administrators' scores 
on the F-Scale Titus and Hollander (1957) reviewed a study (Juul, 1953) 
in which equal itari an teaching attitudes of college seniors, as measured by 
a "How I Teach" inventory, were negatively related with F scores. Rowan 
(1963) found that scores on a limited projective test--the Teacher-Pupil 
Problem Situation Inventory--were associated with F-Scale scores. Hines 
(1956) reported discrepant findings. When principals and five teachers in 
each of 75 schools filled out a Principal Behavior Checklist which was 
later scored for authoritarian and democratic responses, neither score 
correlated significantly with principals' authoritarianism. Hines did find 
that F-Scale scores correlated significantly (r = -.38) with frequency 
of effective practices as judged by 10 education professors with adminis- 
trative experience, but this finding probably tells us more about the 
orientations of the professors than about the principals' behavior. 

Given the early reports by H. H. Anderson and his colleagues (Anderson 
& Brewer, 1946) that dominating behavior by teachers begets dominating beha- 
vior by students, it is not surprising that teachers' F-Scale scores have 
been found to bear a positive relationship (r = .63) to the number of social 
isolates in their classrooms (Gold, 1962). Nevertheless, attempts to relate 
authoritarianism directly to on-the-job performance by educators have met 
with mixed success, even though differences in leadership behaviors by high 
and low F scores were early established in the laboratory (e.g., Haythorne, 
Couch, Langham, & Center, 1956). McGee (1955), in a study of classroom 
behaviors such as 4omi native- in tergrative and insensitive-sensitive, found 
a correlation of .58 between F-Scale scores and his observational estimates 
of authoritarian classroom behavior. Del Popolo (1960) also found scores 
on the Webster, Sanford, Freedman adaptation of the F-Scale to be negatively 
correlated (r - -.62) with frequencies on an Observation Checklist for Stu- 
dent Teachers. On the other hand. Ends (1966) reported t|iat although F- 
Scale scores correlated with a measure of teachers' thoughts about the 
classroom, they did not correlate with classroom behavior. And Liu (1964) 
found little support for the supposition that the personal characteristics 
of student teachers (including as a variable the Webster, Sanford, Freedman 
Authoritarianism Scale) are related to teaching behavior. However, the 
only two significant relationships involved authoritarianism. Liu concluded 
that the less rigid, dominating, dogmatic, and/or autocratic the student 
teacher, the more alert, responsible, confident, and/or self-initiating his 
students were likely to be. 



Authoritariamsm and Thinking 

The assumption is commonly made that if teachers are to teach children 
more effective modes of thought, they must be able to serve as effective 
models of thinking themselves. The possibility that, high authoritarians are 
selected into teaching raises some interesting questions vis a vis the 
teacher's role as a teacher of thinking. In an earlier section of this 
review note was made of the consistent findings that authoritarianism is 
correlated negatively with scores on common tests of intelligence. Di Vesta 
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(1961) in his review of research in learning discussed the tendency of author- 
itarians to be concrete rather than abstract and to come to conclusions with- 
out testing hypotheses. In addition, Tuckman (1965), working with a group 
of 208 soldiers, found F-Scale scores to be negatively related to the complex- 
ity and openness of integration of belief systems. Going beyond intellectual 
modes of thought to attitudes of mind especially relevant to the social 
studies, authoritarianism appears to be negatively related to world mindedness 
(Smith & Rosen, 1958). There also appears to be a low positive relationship 
between F scores and conservatism in risk-taking (Baron, 1968). And, log- 
ical consistency — at least in regard to ideas about education--has been 
found to be negatively correlated with the F-Sca'ie scores of superintendents 
(Gowen, Newsome, & Chandler, 1961; Newsome & Gentry, 1963) and part-time 
college faculty (Gowen et al . , 1961). 



Authoritarianism and the Methods Course 

The general relevance of authoritarianism to teacher education is of 
interest for that part of the research presented in this report which deals 
with the characteristics of a nationwide sample of students in education 
courses. The more general review also provides a background for looking 
at the research which has implications for the reactions of students to a 
social studies methods course which departed from the traditional pattern 
of lecture and concern with planning lessons and units. 



Authoritarianism and Evaluations of the Instructor. Past research 
makes it difficult to predict how authoritarians might react to the instructor 
of a methods course which not only departed from traditional conceptions of 
the course, but was taught in an open-ended way. Obviously, the open- 
endedness of the course might evoke negative reactions in students low in 
tolerance of ambiguity. Yet the findings in research relating F-Scale scores 
to tolerance of ambiguity have been equivocal, despite a theoretical expec- 
tation that authoritarians would be more demanding of certainty. Part of 
the difficulty has been the differing experimental definitions of ambiguity 
(Titus & Hollander, 1957), ranging from reactions to lights in a darkened 
room (Milton, 1957) to the number of blanks not completed in cloze passages 
(Honigfeld, Platz, & Gillis, 1964). 

Studies in leadership are also of little help because of their incon- 
clusive nature (Titus & Hollander, 1957). There is some evidence that high 
F scorers do prefer autocratic leadership in the Navy (Jones, 1954) as well 
as leader-centered discussions in university classes (Harmon, 1955). There 
are also findings which indicate that high authoritarians tend to be less 
critical than low authoritarians in their ratings of military leaders (Jones, 
1954), stimulus persons in experiments (Kates, 1959), and parents (Phares, 
1960). However, Maney (1959) found students’ F-Scale scores were not signi- 
ficantly correlated with their ratings of 8 instructors in a Southern women's 
college. And, Sheffield and Byrne (1967) reported that authoritarianism did 
not have a significant effect upon judgements of the attractiveness of 
strangers varying in degree of similarity to the subjects' own responses to 
an attitude scale. Nor was the interaction between authoritarianism and 
similarity-dissimi larity significant. 
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Among other things, low F scorers seem to be more able to sort out the 
personas personal qualities from his performance in the role being judged. 

While it seems reasonable to expect that authoritarians would react negatively 
to an instructor who is not a lecturer-authority figure and to a course which 
is oriented toward opening and exploring issues of concern in social studies 
education, there may also be a tendency for the authoritarians to be uncri- 
tical in their judgement of any professional authority figure, F-Scale 
scores are therefore, not likely to be related to evaluations of the instructor 
in a modified methods course. 



Authoritarianism and Learning. As already noted, F-Scale scores have 
generally been found to correlate negatively with intelligence scores. It 
would not be surprising, then, to find authoritarianism to be inversely related 
to learning. Stott (1954) reported a negative correlation between F-Scale 
scores and predicted grade point averages for University of Utah students, 
and Davids (1956) reported a correlation coefficient of -.60 for F-Scale 
scores and college grades. Linton (1967) found scholastic achievement on the 
provincial exams in Alberta to be inversely related to F-Scale scores. Oliver 
and Shaver (1963, 1966; Shaver & Oliver, 1968) found that F-Scale scores 
interacted with socratic and recitation teaching in affecting performance in 
an interview scored with categories based on the critical thinking concepts 
taught students in a two-year course. In another study (Neel, 1959), senior 
medical students who were high F scorers had more difficulty than low F 
scorers in learning material dealing with humanitarian philosophy and mat- 
erial judged to be ambiguous. F-Scale scores also were negatively corre- 
lated (r = -.43) with gains on the Minnesota Teacher Attitude Inventory 
during a six-week intensive teacher training course (Vacchiano et at., 1966). 

On the other hand, Bohlke (1960) reported that authoritarianism was 
not related to learning about India or to change in attitudes about India 
on the part of 225 college freshmen. Stotland and Patchen (1961) also 
reported the lack of a clear relationship between authoritarianism and the 
reduction of prejudice in an experimental setting. In other studies, F- 
Scale scores were not related to the learning of interviewing skills (Balensky, 
1963) or the recall of nonsense sentences (Pyron & Kafer, 1967). Despite 
these discrepant findings, reviews of authoritarianism and learning (Di Vesta, 
1961; Harvey, 1963; Jackson & Strattner, 1964) have concluded that F-Scale 
scores generally bear a negative relationship to conceptual learning. 



Cross -Cultural Studies of Authoritarianism 

One aspect of the research to be presented in this report is the 
comparison of American and German education students. The F-Scale has been 
used in several cross-cultural studies. Melikan (1959) and Diab (1959) 
have found Arabs and college students from the Near East to have higher 
mean F-Scale scores than Americans. Niyekawa (1966) reported that Japanese 
high school students have higher mean F scores than American high school and 
college students in her sample. Caution in interpretation must be exercised 
because her Japanese sample is from rural schools, and based on American 
findings may be more authoritarian than an urban sample. Siegman (1962) 
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found different relationships between F-Scale scores and religiousity for 
American and Israeli college students. And, in a parti cularly^relevant 
study, a sample of German workers had a mean F score of 5.26, ". . higher 
than the mean score for any group thus far reported in the literature : . 
(Cohn & Carsch, 1954). 

As Gordon and Kikuchi (1966) have noted, cross-cultural findings must 
be interpreted with caution. Even if the translation of items is accurate, 
differences in culture are likely to mean that an item will have different 
meaning from one culture to another. Niyekawa (1966) also warns about 
applying an American definition of authoritarianism to other cultures.. At 
the same time, if the F-Scale is a valid measure of authoritarianism, it 
should pick up cultural differences. It is interesting, in the light of the 
cultural similarities between the United States and England, that a sample 
of engineering students from the two countries did not have significantly 
different F-Scale means (Peabody, 1961). 



Validity 

It may seem strange that discussion of the validity of the F-Scale has 
been left to the end of the review of research with the measure. However, 
the discussion of the scale's validity seems to be particularly appropriate 
in the context of the number of studies carried out with it. Also, a concern 
for validity makes an excellent transition to the next major section of the 
review dealing with the Dogmatism Scale. 

Obviously, the wide use of the F-Scale noted by Titus and Hollander 
(1957) has continued to the present. Unfortunately, large numbers of studies 
do not insure validity. In fact, the number of equivocal findings raises 
serious questions about the scale's construct validity. Because of the 
tendency for psychological and educational researchers to use whatever 
sample happens to be handy (usually college students in introductory psych- 
ology courses) and because of the differing definitions of the variables 
to which the F-Scale has been related, it is difficult to determine whether 
the inconsistencies in findings are due to the invalidity or lack of sta- 
bility of the scale or to the varying conditions under which it has been 
applied. For example, a finding that high F scorers in a furniture factory 
showed increased productivity with the obvious presence of researchers 
(Rosen & Sales, 1966) held true for workers from urban backgrounds, but not 
for workers from rural backgrounds (Sales & Rosen, 1967). Certainly, one 
of the contributions of the present study will be an indication of the impor- 
tance of using samples that come from different colleges and, therefore, 
from different subcultures. 

It is ironic that one of the consistent findings with the F-Scale-- 
that of negative correlations with measures of intelligence can be inter- 
preted as raising questions about the validity of the F-Scale as a measure 
of authoritarianism. For as Cohn (1952) has pointed out, the negative rela- 
tion of F-Scale scores to scores on intelligence tests may simply mean that 
more intelligent people are better able to figure out what the F-Scale is 
intended to measure, and so give more acceptable answers. 




- 20 - 



There have been methodological criticisms (Hyman & Sheatsley, 1954) 
of the studies (Adorno et al. 3 1950) from which the F-Scale came and of 
the instrument itself. One of the basic questions raised about the scale 
has been whether or not it is over-saturated with the effects of response 
set. Christie and Cook (1958) have reviewed the literature on response 
set and the F-Scale. Continuing research (Messick & Jackson, 1958; Couch 
& Keniston, 1960; Peabody, 1961, 1966; Berkowitz & Molkon, 1964; McBride 
& Moran, 1967; Pedersen, 1967) indicates that acquiescence response set is 
a factor in authoritarianism as measured by the F-Scale, probably accounting 
for about 15 % of the variance. As Brown (1965, p. 514) recommends, resear- 
chers should seriously consider using F-Scale items balanced for yes and no 
responses. Yet, there seems to be no reason to discount authoritarianism 
scores obtained with the original Adorno et al. 3 (1950) measure. 

The Christi e-Cook review (1958) also dealt with factor analytic studies 
of the F-Scale. They concluded that the picture was cloudy, with a lack 
of consistency in the factors which had emerged and in inter-item correla- 
tions from one sample to the next. Since then, Krug (1961) has concluded 
on the basis of his factor analysis of the F-Scale that there is a highly 
reliable general factor, bJt there are also several multidimensional factors 
which are obscured in the total score. Bendig's (1960) factor analysis 
supports the notion of subtraits in the scale. Krug and Mayer (1961) also 
reported that the subscales of the Edwards Personal Preference Schedule 
and the Guilford-Zimmerman Temperament Survey did not have a great deal of 
overlap with the F-Scale, so seem to measure different factors. At least 
two other factor analytic studies of authoritarianism will become relevant 
as the Dogmatism Scale is discussed, for the D-Scale is a direct outgrowth 
of concern with the validity of the F-Scale. 







Ill: DOGMATISM— A MEASURE OF GENERAL AUTHORITARIANISM? 



As the researchers whose work finally resulted in the publication of 
the monumental The Authoritarian Personality explored the problems of 
assessing prejudice, it appeared that an indirect measure— one that did not 
involve the mention of any minority groups— would help to alleviate many 
problems of test administration (Adorno et al 1950, p. 222). The decision 
was to develop a scale to get at "antidemocratic tendencies," and the F- 
(Fascism) Scale (p. 224) was the result. Even with the use of a word (fas- 
cism) which certainly suggests authoritarianism of the political right, 
the F-Scale came to be regarded as a measure of general authoritarianism. 

At least one critique (Christie, 1954) in the review of The Authoritarian 
Personality edited by Christie and Jahoda (1954) was concerned with the 
utility of the F-Scale for detecting non-fascist, left wing authoritarians 
(pp. 130-133). Christie analyzed limited data from The Authoritarian Person- 
ality to conclude that left wingers (Communists and individuals attending 
the California Labor School) obtained low F-Scale scores, probably because 
of the rejection of "ideological cliches representative of the authoritarian 
right" (p. 133). And, he cited earlier studies by Raskin and Cook (1938) 
and Stagner (1936) to support his suppositions. The research of Coulter 
(1953) with English Fascists and Communists, as reported in Brown (1965), 
also adds confirmation to the claim that the F-Scale is a measure of fascist 
or right wing, and not left wing, political tendencies. 

Undoubtedly, the most sustained and productive reaction to the notion 
that some measure other than the F-Scale was needed as a measure of general 
authoritarianism has been that by Milton Rokeach, His concern has resulted 
in the Dogmatism Scale (D-Scale); his book The Open and Closed Mind (1960) 
contains considerable discussion of the scale's validity — especially as 
distinguished from the F-Scale and from cognitive rigidity as measured by 
the Gough-Sanford Rigidity Scale. 

Factor analyses leave little doubt that the F- and D-Scales are heavily 
loaded on the same factors (Ohnmacht, 1967a, 1967b). Moreover, Rokeach 
(I960, p. 121) reported that intercorrelation coefficients for the F- and 
D-Scales had ranged from .54 to .77 in his studies. Other researchers have 
reported a broader range of correlations between the instruments— from .46 
(Vacchiano, Schiffman, & Crowell, 1966) to .82 (Pettigrew, 1958). Although 
high intercorrelations are to be expected if one scale is a general and the 
other scale is a specific measure of authoritarianism, the magnitude of some 
coefficients raises serious questions about the existence of dogmatism, as 
measured by the D-Scale, as a separate variable, 

A factor analysis of the D-Scale by Vacchiano, Schiffman, and Straus 
(1967) indicated that item factors tended to form around Rokeach's (1960) 
definition of D-Scale items. This led the authors to conclude that the 
D-Scale had empirical validity. Another correlation, factor analytic study 
by the same investigators (Vacchiano, Schiffman, & Straus, 1968) has demon- 
strated that the D-Scale is essentially independent of scales taken from the 
16 PF, the Edwards Personal Preference Schedule, the Tennesee Self-Concept 
Scale, and a Machiavellianism Scale. There also was a cluster of correla- 
tions which provided logical support for the concept of dogmatism. Some 
factor analyses have indicated that despite their heavy loading on common 



factors, the D-Scale is factorially discriminate from the F-Scale (Kerlinger 
& Rokeach, 1966) and from the F- and R-Scales (Rokeach & Fruchter, 1956; 
Fruchter, Rokeach, and Novak, 1958). 

Rokeach (1960) also reported data substantiating the claim that author- 
itarians of the right and left will score high on the D-Scale, with author- 
itarians of the left not receiving high F-Scale scores. Barker's studies 
(1963) with New York City graduate students and Ohio State University juniors 
and seniors have lent support to the hypotheses that the F-Scale is biased 
toward authoritarianism of the right and that authoritarians of different 
political leanings will have similar scores on the D-Scale. 

For some reason, response set has not become as salient an issue with 
the D-Scale as it has been with the F-Scale. Studies by McBride and Moran 
(1967) and by Korn and Giddan (1964) have indicated that acquiescent set 
makes contributions to D-Scale scores with positively scored items. Probably 
acquiescent set accounts for no more of the variability with D-Scale scores 
than with F-Scale scores— about 15 %. 

Rather than being concerned with the effect of positively scored items, 
some researchers have investigated variations of the scale. Troldahl and 
Powell (1965) reported that a 20 item D-Scale yielded sufficient reliability 
(r = .79 as compared with r = .84 for the 40 item test) to justify its use 
in field studies, and Shupe and Wolfer (1966) concluded that a two-point 
form of item for the D-Scale was just as reliable, and easier to administer, 
than the six-point item. 

Kemp and Kohler (1965) investigated the suitability of the D-Scale 
for use with high school students, and concluded that reliability coefficients 
were adequate (r * .82, test-retest after 2 months; r = .92 using the Rulon 
formula) to justify the scale's use at that age level. The need to update 
social and cultural items has been of concern (Frandsen, 1967). Also, one 
investigator has reported that subjects can vary their D-Scale scores when 
instructed to do so, with low D scorers being better fakers in the close- 
minded direction (Wolfer, 1967). 

The history of the D-Scale is, of course, much briefer than that of the 
F-Scale. But, like The Authoritarian Personality and the F-Scale, Rokeach 's 
The Open and Closed Mind and the D-Scale have stimulated much research. 

Among the many studies, dogmatism has been found to be unrelated to support 
of Goldwater or Johnson, but related to party switching (Zippel & Norman, 
1967), related to reactions to the movie Dr, Strangelove (Roseman, 1967), 
nationalism (Terhune, 1963), time orientation (Zurcher, Willis, Ikard, & 

Dohme , 1967), binocular fusion of stimuli (Iverson & Schwab, 1967), recall of 
nonsense sentences (Pyron, 1967), attitude change during the preparation of 
belief-discrepant communications (Hunt & Miller, 1968), the number of personal 
problems and the reduction of problems through counseling [Kemp, 1961), and 
attitude toward educational media, but not to use of graphics in teaching 
(Hudspeth, 1966). Wrenn (1962) reported that being subjected to an initi- 
ation did not have a differential effect on the evaluation of groups by high 
and low dogmatic female subjects. 

In business-related studies, Kamenshe (1966) found high dogmatics to 
be more resi stent to change to a computer system in an insurance company, 
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while Druchman's (1967) data suggested that high dogmatics would be more 
resistent to compromise in a bargaining situation. 

Little evidence was found in a study by Nowak (1967) to support the 
claim that dogmatics behave differently in situations in which the perceived 
status of authority figures varies. On the other hand, Vidulich and Karmon 
(1961) reported that dogmatism and the status of an information source 
interacted in a group of female college subjects to significantly affect 
judgement shifts. According to a report by Powell (1962), high dogmatics 
have more difficulty than do low dogmatics in separating message source 
from message content, Becker (1967) found a curvilinear relationship between 
D-Scale scores and dependence on source in judging jokes. McGuckin (1967) 
found only partial support for the hypothesis that subjects would tend to 
indicate more esteem for critics of American foreign policy whose positions 
implied a cognitive type similar to that suggested by the subjects' dogmatism 
scores. In another study of message source, high dogmatics did not show any 
less retention of content presented by a negro, but they did evidence signi- 
ficantly less favorable attitudes toward the negro's message than did low 
dogmatism scorers (Miller & Roberts, 1965). 

Attempts to relate Dogmatism Scale scores to various other personality 
variables have met with varying success. Norman (1966), with a group of 
sophomore female subjects, reported significant correlations for D and MMPI 
scores and between D scores and scores on the Taylor Manifest Anxiety Scale 
and the Barron Ego Strength Scale (the latter two at the .05 level of signi- 
ficance). Yet Zagona and Kelly (1967), using the same anxiety and ego 
strength scales and the D-Scale, found correlation coefficients to be nonsig- 
ficant with a sample of male and female introductory psychology students. A 
small (r = .083) but significant relationship has been found between D scores 
and self-acceptance in a group of high school students (Pannes, 1963). Korn 
and Giddan (1964) found a negative relationship (r = -.30) between Dogmatism 
Scale scores and the "well-being" score on the California Psychological 
Inventory. (The CPI "tolerance" and "flexibility" scores were also nega- 
tively correlated with D scores.) In a somewhat related study, Cummins (1966) 
found that disciplinary offenders and nondiscipl inary students at Michigan 
State University did not have significantly different mean dogmatism scores. 

Ehrlich and Bauer (1966) reported that, with a group of psychiatric 
patients, dogmatism correlated with work and 30cial impairment, initial 
prognosis, and length of stay. With normal subjects, there are indications 
that dogmatics use sensitization and intellectualization, rather than repres- 
sion or denial, as defense mechanisms (Byrne, Blaylock, & Goldberg, 1966). 
However, dogmatism has been found to be positively associated with denial 
of disability and negatively correlated with depression as a reaction to 
blindness (Hollenbeck & Lundstredt, 1966). De Grada and Ponzo (1967) 
reported that pessimists in their study had higher D scores than optimists. 

Along a somewhat different vein of individual differences, the dogmatism 
of counselors has been found to be related to motor inflexibility (Riley & 
Aimlin, 1965), and the association between inhibition of motor responseand 
dogmatism has been found to be stronger than between inhibition and socio- 
economic status in seventh grade students (Anderson, 1962). Also, high 
dogmatics had higher galvanic skin response readings than low dogmatics when 
presented with issues with which they strongly agreed, mildly disagreed, or 
strongly disagreed (Snoek, 1967). 
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Age s Education , and Dogmatism 

Of the many studies using Rokeach's Dogmatism Scale, some are particu- 
larly relevant to the present study. Where appropriate, the same headings 
will be used to indicate relevant areas of research as were used in the 
review of F-Scale studies. One of the concerns of the present study was 
with the relationship between dogmatism and age. Again, as with the author- 
itarianism review, age and education are considered together because they 
are usually confounded in research studies. 

Several studies utilizing college students have found that dogmatism 
scores reduce during college attendance. These include studies with students 
in Michigan (Lehmann, 1963; Lehmann, Sinhra & Hartnett, 1966), in a New 
England university (Katz & Katz, 1967), and sociology students (Frumkin, 

1961). Foster (1961) reported that there was a decline in dogmatism in a 
Catholic university, but it was less by comparison than in sectarian schools. 
Plant (1965) reported that for his sample, there was a decrease in dogmatism 
with age regardless of education. Plant (1966) has also reported a study 
in which both sorority and non-sorority women decreased in dogmatism scores 
over 2 years, with the sorority women showing a significantly greater (p < .05) 
decrease even though the 2 groups did not differ at the start. Of course, 
a basic problem with these studies using college samples is the lack of 
a control group. It is difficult to tell if education is having an effect 
or if decreases in dogmatism scores are due to experimental mortality. 

Two contradictory findings with college-age subjects have been reported. 
Rabkin (1966) found that D scores and age were not significantly correlated 
in a group of 107 teachers in summer courses at the University of Washington. 
This finding may well be due to a restrict 'on of range in age with a sample 
of teachers. The following finding is not so easy to explain, however. In 
a study involving 104 policemen who had attended college and 122 who had not 
(Smith, Loche,, & Walker, 1967), it was found that although the college 
subjects scored significantly lower on the Dogmatism Scale, those in the 
college group with ages from 25 to 29 had significantly higher scores (p < .01) 
than those from 21 to 24. 

Research with below-college age subjects has produced negative rela- 
tionships between education-age and dogmatism. This has been true for Ander- 
son (1962) with subjects in grades 8 through 12 and Pannes (1963) with 
seventh through twelth graders. 

Dogmatism has been found to be negatively correlated with age in 244 
insurance company employees (Kamenshe, 1966), and with age and education 
in psychiatric patients (Ehrlich & Bauer, 1966). With few exceptions, 
then, education seems to be negatively associated with dogmatism, with the 
relationship between dogmatism and age less clear. 



Sex and Dogmatism 

Those studies in which male- female differences on the F-Scale were tested 
for significance have yielded equivocal findings. To this date, reports 
with the D-Scale have presented a somewhat clearer, if slightly clouded, 
picture. A number of studies have reported no significant differences 
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between male and female subjects for a variety of groups: teachers attending 

simmer school at the University of Washington (Rabkin, 1966), severth 
through twelfth graders (Pannes, 1963) and eighth through twelfth graders 
(Anderson, 1962), psychiatric patients and therapists (Ehrlich-ft Bauer, 

1966). In addition, it has been reported (Lehman, Sinhra, & Hartnett, I 9bbj 
that there was no difference between males and females in the amount of 
reduction of dogmatism scores during '.ollege. 

On the other hand, Lehman (1962' found the males, in his sample of 
over 2000 entering freshmen at Michigan State University, to be more dogmatic 
than the females. In another study (Korn & Giddan, 1964). -.the correlation 
between dogmatism scores and verbal scores on the Scholastic Aptitude Test 
was - 18 for males (P < .01), but -.003 for females. Vacchiano, Shiftman, 
and Strauss (1967), in inspecting the factors for males and females emerging 
from their analysis of the items on the D-Scale, found indications that the 
D-Scale measures different dimensions for the two sexes. Alter and 
(1966) suggested that the significantly higher mean D scores for males in 
their Utah sample might be due to a few items. Along with the possibility 
that males and females may be responding to the items in different contexts, 
the findings on sex differences in dogmatism scores are equivocal. 



Intelligence and Dogmatism 

Rokeach (1960, p. 190) reported that the correlation between scores on 
the Dogmatism Scale and intelligence (as indicated by "a i rough "J easur ® " 
the American Council on Education Test) was -.02, essentially z ^ r °* 
went on to suggest (Rokeach, 1960, pp. 403-8) that because the D-Scale 
measures a kind of intelligence, the finding of no relationship with scores 
on commonly used intelligence tests calls for a rethinking of current con- 
cepts of the nature of intelligence. Christensen (1963) also found the 
correlation between ACE scores and D scores to be essentially on 9 

and Ziller (1965) found verbal scores on the SCAT to be independent of 
dogmatism scores for their sample of freshmen women at the University or 
Delaware, and Uhes (1968) reported a nonsignificant negative relationship 
between Dogmatism Scale scores and scores on the g factor of the General 
Aptitude Test Battery. 

All of the data have not supported Rokeach’s earlier finding, however. 

A neaative correlation (r = -.28, p < .05) was found between dogmatism and 
scores on the Ohio State Psychological Examination (Ehrlich, 1961) --a 1 though 
Kemp (1962b), apparently referring to the same study by Ehrlich, reported 
the P r to be -.001 (p. 11). Pannes (1963) reported a negative relationship 
(r = -.106, p < .01) between dogmatism and “IQ" (the test used was not 
reDorted). Using the verbal score from the College Qualification Test, 
small but significant relationships with dogmatism scores have been obtained 
(Zagona & Zurcher, 1965; Zurcher, Willis, Ikard, & Dohme, 196 ^* , 

in a study indicating the value of subgroup analyses, Korn and Giddan (1964) 

found dogmatism to be negatively correlated (r = ■ -.20) i ^ r crlr^° reS 
on the Scholastic Aptitude Test, but not w.th SAi quantitative scores. 
Moreover, using the verbal SAT scores and a measure of dcgnatic designed 
to remove the effects of response set and analyzing their data jeparately 
for males and females (all of whom were freshmen at Stanford), they found 



ERIC 



-26- 



a significant negative coefficient (r = -.18) for males but a nonsignificant 
relationship (r = -.003) for females. This helps to account for the finding 
reported by Long and Ziller (1965) with the SCAT. Coupled with the diff- 
erent findings for verbal and quantitative scores when that breakdown was 
analyzed, the male-female difference in Korn and Giddan's study suggests 
the extent to which inconsistencies in tests and samples might explain 
divergent research results with the Dogmatism Scale. In any event, although 
some studies report significant dogmatism-intelligence relationships in 
contradistinction to Rokeach's finding, it is clear that the relationship, 
if any, is negative and small. 



Geographic Location, College , and Dogmatism 

Variations in dogmatism scores among students from universities located 
in different parts of the country have not generated the interest tnat 
has been the case for the F-Scale. Generally, analyses have not been made 
of the significance of any differences among such groups, although a table 
presented by Alter and White (1966) summarizing data from several studies 
indicates that there are likely to be wide variations in dogmatism means 
from one subculture to another. Girault (1968) found that teachers from the 
South tended to have higher D-Scale scores than those from other regions. 

The other reported analysis (Lehman, 1962a) will be discussed later because 
his groups definitely reflected religious differences, 

Kerlinger and Rokeach (1966) have, for example, reported the means of 
different college groups used in their factorial analysis of the D-Scale 
and the F-Scale . The means are remarkably similar for Michigan State Uni- 
versity, Louisiana State University, and New York University students (149.48, 
148.84, and 145.44 respectively) and no test of significance was reported. 

A mean reported for Un.versity of Utah students (149.4) was similar (White 
and Atler, 1967). Rabkin (1966) referred to a series of five means on Ohio 
State University students ranging from 141.3 to 143.8 (reported in Rokeach, 
1960). He found the mean for his 107 teachers (132.3) to be significantly 
lower (p < .05) than those of two of the Ohio State groups. 



Religion and Dogmatism 

Looking at specific identifiers of subcultural differences is likely to 
be more productive than gross comparisons of subjects from different geo- 
graphical locations. If Rokeach's formulation of dogmatism is correct, 
members of fundamentalist religions should have high dogmatism scores 
(accompanying the high authoritarianism scores reported in the section 
of the review on the F-Scale and religion). In addition, however, those 
who are far "to the left of center" in their religious orientations should 
also have relatively high dogmatism scores. 

Rokeach (1960, pp. 109-115) has presented data from two studies which 
he interprets as supporting these expectations. With a group of Michigan 
State University students, he found a decreasing order of mean dogmatism 
scores for Catholics, Protestants, and those professing no religion (means 
equal to 191.1, 180.1, and 174.6 respectively, with the Crtholic-Protestant 
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and the Catholic-nonbeliever comparisons significant at the .01 level). 
Referring to scores on his Opinionation Scale, Rokeach suggested that all 
three of these groups were actually "right of center" groups. With a 
sample of students from New York colleges, categorized as Catholic, Protes- 
tant, Jew, or nonbeliever, Rokeach found nonbelievers who fell "to the left 
of center." Here the Dogmatism Scale means were, respectively, 147.4, 138.3, 
131.5, and 147.2. Although none of the differences between means is signi- 
ficant at the .05 level, Rokeach interprets the trend as supporting his 
expectation that right and left of center religious groups will both have 
mean dogmatism scores higher than center opinionation groups. 

Cline and Richards (1965) conducted a study in Salt Lake City using a 
modified TAT, along with a depth interview and a Religious Belief-Behavior 
Questionnaire, to gather data on a random sample of 155 adult male and fe- 
males. Factor analysis of their data produced a factor which they labeled 
dogmatic Authoritarianism. According to the researchers, ". . .this factor 
does not involve any tendency toward conventional or conservative religious 
beliefs" (p. 574). They concluded that their results were consistent with 
Rokeach 's formulation, because they suggest that ". . .an unbeliever is just 
as likely to be authoritarian about his unbelief as a believer is to be 
authoritarian about the dogmas of his faith" (p. 574). 

Other studies have supported the general notion that there is a positive 
relationship between dogmatism scores and the extent to which the subject's 
declared religion is to the right of center. Quinn (1564), although reporting 
no means or statistical analysis, reported the following sequence of high 
to low Dogmatism Scale scores for his groups: parochial school Catholics, 

public school Catholics, Jews, Protestants. Lehman (1962b) found with a 
sample of 2,746 entering freshmen at Michigan State University that Catholics 
were more dogmatic than Jews or Protestants, and that students from parochial 
schools were more dogmatic than those from public schools. In another study, 
Lehman (1962a) compared random samples of 50 males and 50 females from a 
Presbyterian liberal arts college, a Congregational liberal arts college, and 
a large state university— all in the Midwest. With both males and females, 
there were significant differences among students from the three colleges 
with the means in the following descending order: Congregational, Presby- 

terian, state university. It is interesting to compare the mean dogmatism 
scores (176.27) of Uhes' (1968) high school students with that (165.75) 
of Kemp and Kohler's (1965) high school sample. Uhes' Utah sample was 
largely Mormon, while Kemp and Kohler's midwestern sample was likely heavily 
Protestant. It is also worth noting that a study with Italian adults 
(Di Renzo, 1967) indicated a significant relationship between degree of 
religious practice and dogmatism. 

Rabkin (1966) working with extremely small subgroups of teachers (80 
Protestant, 6 Catholic, 3 Jewish, and 18 unknown) found no significant 
differences among his religious groups. Given the small number of Catholics 
and Jews and the relatively large group of unknowns, it is difficult to 
give his results much credence. 

Only one Dogmatism Scale study could be found which was related to the 
notion that conservative believers and unbelievers would be equally dogmatic. 
Feather (1967) compared four groups of students— 30 from the fundamentalist 
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Evangelical Union, the Newman Society, the Student Christian Movement, and 
10 atheists. The first group was highest in dogmatism and the last group 
was the lowest. Feather's findings contradict directly the Rokeach^ hypo- 
thesis because atheists would likely be among the strongest of the "non- 
believers"--probably, for example, maintaining a more stringent position 
than those who label themselves "agnostics." 



Dogmatism and Teacher Education 

There is considerable reason to suspect that dogmatism is a variable 
which should be of interest to teacher educators. There is little evidence, 
other than that indirectly available from other personality measures as 
already cited in the earlier review of studies of authoritarianism, that 
those who go into teacher education are more dogmatic than college students 
selecting other fields, even though there is speculation that teachers 
are more dogmatic than other people (Soderbergh, 1964). 

Of some possible relevance is a study by Kemp (1964) in which college 
seniors were tested and then retested six years later using the Kuder 
Vocational Preference Record, the Strong Vocational Interest Blank, and 
the All port- Vernon Study of Values. Although both high and low dogmatics 
changed value and vocational interest patterns, high dogmatics tended to 
change in the direction of occupations offering more security and more 
opportunity to exercise direction and control, while low dogmatics tended 
to shift toward more interest in social service occupations. While it 
could be argued that teaching is a social service occupation, there is also 
little question but it is an occupation which offers security and the 
chance to exercise direction and control. Friedenberg (1959) used these 
characteristics of teaching to argue that those who become teachers tend 
to be timid and constrained. 

If, indeed, teachers do tend to be dogmatic (a claim not yet estab- 
lished), there would be reason to describe them as constrained. Although 
Leckart and Wagner (1967) found no difference between high and low dogmatics 
in the time spent looking at novel stimuli, there is considerable evidence 
that reactions to novel stimuli are related to D-Scale scores. Mikol (1960) 
found closeminded subjects to be less accepting of new music and a new 
composer than openminded subjects. Zagona and Kelly (1966) projected what 
they considered to be a novel film of color designs accompanied by jazz 
music. Although high dogmatics did not indicate any less preference for 
complex and asymmetrical designs, as measured by the Graves Design Judge- 
ment Test, they were less accepting of the movie. Dogmatism has also been 
found to be a significant factor in painting preferences (Frumkin, 1960). 

Kaplan (1963) concluded that high dogmatics are less open to experiences 
after finding that they have lower sensory acuity than low dogmatics. He 
used measures of olfactory, gustatory, tactile, auditory, and visual acuity. 
His low dogmatics had higher scores on five of the six measures, with a 
trend in that direction on the sixth. In addition, the correlation between 
D scores and total sensory scores was -.61. A related corroborating study 
is that by LoSciuti and Hartley (1963). They concluded that low D scorers 
tend to see stimuli -rom religions other than their own more often than do 
high dogmatics. 
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The lack of openness to the novel and different is reflected in Kamenske s 
(1966) finding that high D scorers employed by an insurance company were 
more resistant to a change to a computerized system. And, Childs (196o) 
has reported that in a study of eight school districts in Michigan, a signi- 
ficantly greater proportion of teachers and administrators were open- 
minded," as measured by the D-Scale, in districts judged to be innovative 
as compared to districts judged to be non-innovative. If innovation and 
change are not only present in current education, but vital to increases in 
educational effectiveness, the findings reviewed above have serious impli- 
cations for the selection of prospective teachers. 



On the other hand, attempts to use the Dogmatism Scale in the educational 
settinq have not always produced such clear cut results. Fish (1962), on the 
basis of essay test results with 18 students in a science education course 
for elementary teachers, concluded that low dogmatics are better able to 
describe developmental sequences after reading scientific generalizations 
and so would be better able to direct discovery learning. Girault (1968) 
reported that the dogmatism scores of 414 history teachers correlated (r * 

31) with scores on an Objective versus Subjective History Scale (A high 
score on the History Scale is supposed to indicate a tendency to see history 
as a body of concrete, objective, unchanging facts, with the historian a 
recorder-reporter of these facts.) Ohnmacht (1967b) concluded, on the basis 
of a factor analysis using the D-Scale, F-Scale, Closure Flexibility Test, 
Embedded Figures Test, and the Teacher Characteristics Scale (anticipated 
teaching behavior) administered to a sample of 57 male secondary education 
majors, that a high dogmatic-low analytic factor appeared to identify subjects 
who were not. likely to be imaginative, stimulating classroom teachers as 
compared tc low dogmatic-high analytic and low dogmatic-low analytic subjects. 
James Johnson (1966) reported that the D-Scale scores of student teachers 
at Georgo Peabody were significantly related to their attitudes toward teac 
inq an d the teacher-pupil relationship. Significant, but low, correlations 
were ^ound between dogmatism and ratings of the student teachers y t eir 
supervising teachers. But no relationship was found between D scores and 
ratinqs by college supervisors or the student teachers' ratings of their own 
success. Sears (1968) found that his closeminded teachers received more 
traditional scores on a Philosophy Scale, but didn't differ from his low 
dogmatics on certification rank (females did have a significantly lower 
certification rank, suggesting that the males' economic responsibilities 
may have been an important factor) or in participation in professional 

groups. 



Such findings are interesting, but they suffer from the deficiencies 
of so much research in teaching- they are one step removed from actual class 
room behavior and at least two steps removed from the effects of teacher 
behavior on students. Studies relating dogmatism to teacher behavior are 
scarce. Although Hanny (1967) concluded that high dogmatics can be taught 
to use interaction analysis to control their verbal classroom behavior, 
Ohnmacht (1967a) found that scores on the Dogmatism Scale were not related 
to indirect-direct teacher behavior as measured by Flander s Interaction 

Analysis. 



The slight usefulness of the Dogmatism Scale in predicting actual school 
behavior on the part of teachers and principals is also indicated by re- 
search into school climate and ratings of one another by school personnel. 
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Gordon (1968) found the dogmatism scores of school administrators to be 
positively related to traditional scores on a Philosophy Scale. When related 
to the ratings of administrators by school personnel using the Leader Behav- 
ior Description Questionnaire (LBDQ), the dogmatism scores of administrators 
were negatively related to the Consideration dimension of the LBDQ, but 
showed no significant relationship to the Initiating Structure dimension. 

The dogmatism of the administrators was also not related significantly to 
the holding power of their schools, although there was a trend toward a nega- 
tive correlation. Bentzen (1968) found that the relationships between D- 
Scale scores. Organizational Climate Description Questionnaire (OCDQ) scores 
and Teacher Power Scale scores varied tremendously from school to school. 

(The range of correlation coefficients for D-Scale and OCDQ scores was -.11 
to .62). Huesk (1966) carried out a study employing 44 elementary schools, 
714 teachers and 44 principals. His data produced little support for his 
prediction that the complying behavior of dogmatic teachers would result 
in a positive relationship between teachers' D scores and principals' ratings 
of the teachers, with differing behavior resulting in a negative relation- 
ship between dogmatism and ratings by the teachers' peers. Correlation 
coefficients for teachers' D scores and principals' ratings ranged from -.71 
to .67 from school to school. He did find that for 26 schools from which 
he had ratings on a "happy- troubled" scale, there were negative relations 
between the principals' ratings and teachers' D scores for happy schools, 
and positive relationships for troubled schools. 

Croft (1965) found no relationship between principals' dogmatism and 
their ability to judge how they were perceived by their teachers or super- 
intendent. Bentzen (1966) investigated the relationship of teachers' D 
scores to ratings of their congeniality as a co-worker by princiapls and 
fellow teachers. No significant relationship emerged. The range of 
correlation coefficients for teachers' dogmatism scores and principals' 
ratings was -.706 to .671. In a study looking to the effects of princi- 
pals' dogmatism on ratings, Musella (1967) found that high dogmatic prin- 
cipals tended to pick high dogmatic teachers as effective (Bentzen, 1966, 
found that dogmatic principals tended to give dogmatic teachers lower con- 
geniality ratings), while low dogmatic principals tended to select both 
low and high dogmatic teachers as effective. 

Generally, the findings are equivocal. They indicate that researchers 
must be careful about interpreting correlations between principals' or 
teachers' dogmatism scores and their ratings of or by others, as well as 
correlations between the dogmatism of school personnel and the organiza- 
tional climate of schools, when these correlations are based on only one 
school or district. 



Dogmatism and Attitudes Toward Children 

Is the teacher's score on the D-Scale likely to reveal anything about 
his attitudes toward children and the disciplining of children? One area of 
research that might help in answering such a question is that investigating 
the relations between dogmatism and counseling skills, for it is generally 
assumed that an ability to empathize with and understand others is a basic 
prerequisite to successful counseling. No clear relationship between coun- 
seling ability and D scores has emerged, although as one so often finds in 
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a review of research in authoritarianism and dogmatism, the plethora of 
criterion measures and of samples used makes any interpretation difficult. 

Saltzman (1967) found a negative relationship between dogmatism and 
regard for others (as measured by the Barrett- Leonard Relationship Inventory) 
for 30 participants in a counseling institute. He also found that medium 
D scorers improved more from the use of programmed materials dealing with 
counselor attitudes, and concluded that the relationship between dogmatism 
and the learning of human relations skills did not seem to be linear Kemp 
(1962) used Porter's Test of Counselor Attitudes and behavior in actual 
counseling situations to compare students who participated in a counseling 
practicum with a control group who did not. He found no significant changes 
in the control group. Low dogmatics tended to change toward greater per- 
missiveness and understanding, while high dogmatics changed, from the test 
to the counseling situation, toward less understanding. Kemp concluded that 
apparently the high D scorers gave permissive responses on the Porter test 
to agree with the instructor, but this did not indicate an effect on coun- 
seling behavior. 

On the other hand, Rosen (1967) reported that for the 28 participants 
in a counseling institute, the Dogmatism Scale was not among the best four 
predictors (chosen from the Dogmatism Scale, Allport-Vernon Study of Values, 
the Strong Vocational Interest Blank for Men— Social Science, the Edwards 
Personal Preference Scale, the National Education Act Comprehensive Examin- 
ation in Guidance and Counseling, years of teaching experience, years of 
counseling experience, and age of counselor) of any of the criterion vari- 
ables (rating scales— Accurate Empathy Scale, Self-Congruence Scale, Uncon- 
ditional Regard Scale, all combined to a Total Competency Scale; and the 
same scales scored from tape recordings of counseling interviews). Milliken 
and Paterson (1967) generally found no statistically significant differences 
on the D-Scale between good and poor counselors, except when the groups were 
defined on the basis of supervisors' composite rating scores. 

There have been more direct attempts to measure attitudes toward chil- 
dren. When college students were asked to take the D-Scale twice— once for 
themselves and secondly as they thought a "typical" college student would— 
the correlation between the two sets of scores was .43 with the estimated 
D scores higher on the average (Burke, 1966). Low dogmatics were both more 
accurate in their estimations and more variable. Burke took this as an 
indication that dogmatism is negatively related to sensitivity toward others. 
His findings seem to indicate at least a negative relationship with empa- 
thetic ability of one sort. On the other hand, Brumbaugh, Holdt, and Beisel 
(1966) found no relationship between the D scores of student or supervising 
teachers and their ability to predict the others' personality needs as 
measured by Schulty's FI R0 . 

Vacchiano, Schiffman and Crowell (1966) found that with a group of 27 
male and 28 female graduate students who lacked prior teaching experience 
or teacher training, the correlation between the Minnesota Teacher Attitude 
Inventory (MTAI) and D-Scale scores was -.40. A nonsignificant correlation 
between D scores and gains on the MTAI during a 6 week intensive teacher 
training course (r = -.19) indicated that dogmatism was not related to 
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changes in attitudes toward teacher-student relationships in the classroom. 
The finding of Vacchiano and his associates with the MTAI scores is support- 
ed by Hoy's (1966) report that for 805 teachers and 168 principals in 
Pennsylvania, those who were more closeminded tended to be more custodial 
in their pupil control ideology. 

A study by Rebhun (1967) is also relevant. He found significant rela- 
tionships between the dogmatism scores of male undergraduates and the 
following attitudes as measured by the Schaefer-Bells Parental Attitude 
Research Scale: fostering dependency, seclusiveness, breaking the will, 

harsh punishment, demanding activity, deification of parent, ascendancy of 
husband, suppression of affection. He concluded that dogmatic individuals 
tend to hold parental attitudes which discourage children from intruding 
upon the parents' belief-disbelief systems. If his findings and conclusions 
are applicable to females and to those males going into teaching, the impli- 
cations for the teacher's relations with his pupils could be serious. 

One other study (McCarthy & Johnson, 1962), although tangential, is 
relevant here. It involved the reactions of 157 undergraduates toward 
the demonstrations against the House Un-American Activities Committee's 
hearings in San Francisco in 1960. Authorities claimed that the demonstra- 
tions were communist inspired; the students involved said not. The subjects, 
none of whom were involved in the demonstrations, were asked which account 
they accepted. High dogmatics tended to accept the official version; low 
dogmatics tended to accept the student account; those neutral in opinion 
tended to fall in the middle of the D-Scale range. Perhaps it is not 
stretching one's imagination too far to suggest that dogmatism might be a 
factor in the teacher's willingness to accept his students' versions of 
their apparently deviant behavior. 

In any event, despite the implications which studies such as the above 
have for teacher attitudes, one can only conclude that it is unfortunate 
that there have not been more studies dealing directly with classroom behav- 
ior. It must be recalled that Ohnmacht (1967a) found no relationship 
between teachers' dogmatism scores and their indirect-direct behavior as 
measured by Flander's Interaction Analysis. 



Dogmatism and Thinking 

If scores on the Dogmatism Scale do indicate the degree of openness of 
one's belief-disbelief systems, this variable might well be of concern to 
educators who believe that one of the functions of public education is 
to prepare students for more effective thinking. Can teachers who are basi- 
cally closeminded in their construing of the world educate students to be 
otherwise? Although one might be inclined to give a negative response to 
the question, there is unfortunately no research evidence bearing directly 
on it. However, with the assumption that the teacher's thinking competen- 
cies would bear some relationship to his instructional effectiveness, 
perhaps due to the model of thought he would provide his students, some 
research findings become relevant. 
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The relevant research findings do not, however, provide a firm basis 

for concluding that the dogmatism of teachers, i ,s "^“^that the corre- 
is a crucial variable in education. It will be recalled that the corre 

lations between F-Scale scores and intelligence test scores have been con- 
sistently significant and negative, even though low. However, f ’" dlng * 
with thIV Scale have been equivocal, this despite a finding (Rebhun, 1966) 
of a wsi?1ve association (coefficients of about .25) between D scores and 
SnxietS (al Se«Sred by Sarason's Test Anxiety Scale), leading Rebhun to 
conclude that dogmatism might impair intellectual functioning, 'hereto , 
if intelliaence test scores are taken to be measures of thinking abil ty 
or iStSntial? Research with intelligence tests gives little assurance that 
the P D- Scale is an effective predictor of teachers' thinking ability. 

Other evidence in regard to dogmatism and thinking is available. Kemp, 
in two different studies (1960, 1963), has found high and low dogmatic college 
i?,,d^tfto differ significantly in their scores on selected problems in 
critical thinking and on the Watson-Glaser Critical Thinking. Appraisal . 
Klein C (1966)reported that D scores were negatively related to deductive 
reasoning scores (apparently based on the Watson-Glaser test). 

ikina a somewhat less direct measure of "critical thinking, an inves- 
tiqation by Kleck and Wheaton (1967) indicated that although high dogmatic 
high school students did not show significantly greate^ 
information consistent with their opinions, they did recall sigmricanciy 
less inconsistent information and they evaluated consistent information 
™re favorably than did low dogmatics. Killer and Lobe (1967) concluded 

aboui 0n grefter a rt U titudrchln U grwi th hi Sh'dla^c^lhan^ th Z dogmatics . 

S Ss.’a-wss’^'ss s?ss- ssx&f 
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studies education to teach methods of inquiry rather than just f ^ts, the 
findina of Girault (1968) referred to earlier (that dogmatism is corre lacea 
w th a 9 view o? h story as concrete and objective, with the historian a 
recorder-reporter of facts) is relevant to a concern with teachers abili 
ties to teach for critical thinking. 

a n n f thp evidence is not consistent, however. Rokeach (1960) has 

tpct to screen out "below average" people (to induce a stress condition), 
thprp wac a curvilinear relationship between dogmatism and scores on per- 
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parti ailed out. Johnson concluded that his results imply that extreme onen- 
mindedness may be as disruptive as extreme closemindedness. 

Cognitive measures of creativity have shown small negative relationships 
with dogmatism. Jacoby (1967) reported that with a sample of 24 graduate 
students in business education, the correlation between dogmatism scores 
and scores on the Remote Associations Test was -.248 (p = .12). Zagona and 
Zurcher (1965) found significant differences between high and low dogmatics 
in their ability to form remote verbal associations. Uhes (1968), using a 
sample of over 200 high school seniors, found low but significantly negative 
relationships between D scores and several measures of divergent and conver- 
gent thinking. 

The relationships between D scores and thinking ability, measured in 
terms of critical or creative thinking, seem to be negative and low. KemD 
(1962a) has suggested that the relationship between dogmatism and synthe- 
sizing ability holds true with unfamiliar tasks. Familiar tasks and those 
in which an authority gives hints as to the "proper" synthesis apparently 
do not present the closeminded person with relatively greater difficulty as 
compared to low dogmatics. These qualifications, plus research results 
(Johnson, 1966; Kessler & Konenberber, 1967), raise serious questions about 
the relationship between dogmatism and synthesizing ability as hypothesized 
by Rokeach. The varying research results indicate that the conditions under 
which the relationship is assessed are likely to be of critical importance. 



Dogmatism and The Methods Course 

As in the review of studies using the F-Scale, two aspects of dogmatism 
and students' reactions to the methods course are of interest here: (1) 

the relationships between the students' dogmatism and their ratings of \,he 
instructor, and (2) the relationship of dogmatism and learning. There was 
not a great deal of research available that was related to the first cate- 
gory, in contrast to the review of authoritarianism research. The review 
to this point indicates that dogmatics may find synthesis more difficult, 
especially under stress. Girault (1968) reported a relatively high (r = .45) 
correlation between D scores and scores on the Walks Ambiguity Scale 
(intolerance of ambiguity). These factors would indicate that high dog- 
matic students would react negatively to a chan^ to a methods course that 
(1) required an analysis of social studies goals, along with a synthesis 
into curricular proposals on the part of the students, and (2) was more 
ambiguous in that examinations involved the discussion of curricular issues 
in essay questions, rather than the testing of specific content through 
objective- type questions. Yet, the studies reviewed earlier, indicating 
conflicting findings when the ratings of school personnel were related to 
dogmatism, make it difficult to conclude from the literature that any par- 
ticular relationship would likely be found between dogmatism and the students' 
ratings of the instructor. 



Dogmatism and Learning. One of the concerns of the present study is 
the relationship between D scores and indices of learning in the methods 
course. In an earlier review, Jackson and Shattner (1964) indicated that 
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doamatism qenerally correlates negatively with learning. Ohnmacht (1966) 
has foinS dogmatism to be a factor in concept formation tasks involving a 
reversal of the initial task. Adams and Vidulich (1962) used volunteers 
from an introductory psychology course in an experiment in Pa^ed-associate 
learnina Their high dogmatics made significantly more errors with both 
belief confruent and belief incongruent pairs of noun stimuli and a^^ive 
resDonses Moore (1962) found, in a study of the conditioning of the verbal 
behavior of 369 university students, that low dogmatics conditioned sigm- 
ficantl v better Kemp (1963) carried out a study somewhat more in line 
„]ththe traditional conception of learning. He found that when 80 univer- 
citv freshmen were qiven special training in problem solving and tested for 
atinstn the Watson-Glaser Critical Thinking Appraisal, the improvement of 
high D scorers was not significant, while the low D scorers showed improve- 
ment significant at the .01 level. 

Frumkin (1961) found that high dogmatics obtained . s i9"ifi«^'y ^” er 

provoked a seHes Sf further Research studies, found dogmatism to be signi- 
ficantly and inversely related to learning in an introductory sociology course, 
admeasured b^scores on a true-false test. This relationship (significant 
ft Si level) held up with Ohio State Psychological Examination scores 
partial led out (The relationship between learning and OSPE scores was not 
sionificant with D scores partialled out.) Christensen (1963) replicated 
Ehrlich's study at another university, but using students in ®"’ n *r?^ C c hr . 
torv psychology course. He did not find the inverse relation p 
lich reDorted 9 (using multiple-choice and essay questions as measures of 
earnino) He suggested that his differing results might be due to a more 
riqorous approach in psychology which might be more congruent with the needs 
of the dogffic person Costin (1965) attempted to duplicate Ehrlich s 
method while using Christensen's subject matter determine if method was 
a factor in the different findings. He found a slight (r .yj, Dut ™ 9 

nifirant relationship between dogmatism and learning scores, and proposed 
thafdoomatism-l earning relationships are confounded by different types of 
dooLtism aid the type of material being learned. In order to determine if 
tp«t format wa' a factor in the contradictory Ehrlich-Christensen findings, 
White and Atler (1967) used both true-false and multiple-choice i temswi 
?f)99 introductory psychology students at the University of Utah. y 
Cnd “tf 14 P c^asses 9 involved there were 12 negative an , posi tm 
rnrrplations between D scores and learning scores, with an overa I i correia 
Hnn nf - 1 R f d < .01). They found no difference between the results obtained 

with the true-false and with the multiple-choice items. A stud ^ ^"ducted 
at the University of Arizona with 517 introductory psychology students Ua- 
nnna x, 7urcher 1965) also produced a low (r = -.20), but significant ^p 
^001) neqative*correlation between D and learning scores. In addition, t e 
inveltioators noted^ ^’striking" differences in class performance between 
iigh iidlow dogmatics, with 9 the former being characterized as mtellectu- 
ally lethargic and unwilling to relate to others. 

As with the research dealinq with dogmatism and thinking (synthesizing 
abi 1 i tv) W1 low neqati veand nonsignificant relationships between dogmatism and 
^iiina’are iHvidence And, similarly, the indications are that whether 
or not relationships will be found between learning and dogmatism will de- 
pend to a large extent upon the conditions under which the relationship 

assessed. 
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Cross-Cultural Studies 

Little has been done using the Dogmatism Scale in cross-cultural studies, 
although Rokeach (1960, pp. 409-410) has suggested that the structure- 
orientation, as compared to content-orientation, of the D-Scale increases 
the feasibility of such studies. Rokeach's (1960) data with English stu- 
dents are of little value because of the limited numbers in his samples. 

He made no tests of significance for cross-cultural differences. Peabody 
(1961) did not fi*ri a significant difference between English and American 
engineering students. Other studies are simply lacking. Certainly, any 
that are carried out must pay attention to the possible constraints, men- 
tioned in the review of the F-Scale research, on the use of tests developed 
in one culture to assess personality in ar j her. 



Summary 



A call to summarize the vast array of research with the F- and D- 
Scales staggers the mind. In fact, the great number and variety of research 
studies using the two scales is one of the striking realizations coming from 
a review of the literature. The other striking, if unfortunate, realiza- 
tion Is the plethora of samples (usually selected on the basis of expedi- 
ency) and variables used in the authoritarianism and dogmatism research. 

It is difficult not to conclude that the great number of equivocal findings 
may be due largely to uncontrolled sampling fluctuations and a lack of 
consistency in measures from study to study. One of the few attempts to 
methodically follow up previous research has teen in the area of dogmatism 
and learning. But even here, inconsistencies in subject matter, samples, 
and tests have clouded the findings. 

Despite the generally disorderly nature of the research, it seems worth- 
while to summarize briefly some findings particularly rei vant to the pre- 
sent project. To begin with, although both authoritarianism and dogmatism 
appear to be potentially potent variables in the study of education, there 
is little research bearing directly on the question whether education as a 
profession does attract people who are high authoritarians or more close- 
minded than those in other fields. By the same token, while both scales 
have been found to be related to expressions of attitudes toward children 
and teaching, there has been little success in attempts to relate F or D 
scores to classroom behavior. The findings that high authoritarians are 
less able to empathize with others, less child-centered, and more inclined 
to bo punitive, and that high dogmatism scorers are likely to be less open 
to new stimuli and to change are suggestive. But such findings are likely 
to be of little import to teacher selection or education unless they can be 
confirmed in the classroom. 

One part of the present study has been a concern with the reactions of 
college students to a changed social studies methods course. One indication 
of their reactions was their ratings of the instructor of the course. There 
is little basis in the literature for predicting how those evaluations might 
be related to dogmatism or authoritarianism. High F scorers might, on the 
one hand, be expected to react negatively to an instructor who was a socra- 
tic, rather than a lecturer, authority figure. On the other hand, it also 
seems clear that high authoritarians tend to be less critical in their 




judgements of authority figures. There is little related evidence from the 
research with the Dogmatism Scale. However, high dogmatics might be expect- 
ed to react negatively to the ambiguity and the demands for synthesis in 
the course, resulting in a negative evaluation of the instructor. But this 
prediction is based on so little research that it cannot be made with any 
confidence. 

Measures of learning present another possible criterion of "reaction" 
to the methods course. The review of research with the F- Scale indicated 
that it would be reasonable to expect a small, but negative relationship 
between F scores and learning variables. The finding that dogmatism scores 
are negatively related to increments in critical thinking after instruction 
are counterbalanced by the confused findings in introductory sociology and 
psychology courses. Any relationship between dogmatism and learning in 
the present study would be expected to be negative and low. 

The second part of the present study has been concerned with the possi- 
bility that different samples of teachers might produce different results 
in the type of research which constitutes the first part of this project. 

The review, as already mentioned a number of times, has indicated the great 
variability in findings that can occur from one sample or subsample to another 
in studies using the F- and D-Scales. One reason that the question of sam- 
ple variation arose in the present study was that the project was to be 
carried out on a group of students who were predominantly Mormon. The 
review has indicated that the level of dogmatism or authoritarianism is 
likely to vary with samples of differing religious inclinations. This effect 
will be explored further in the presentation and discussion of data in 
Chapter V. 

The relationships of several other variables to F- and D-Scale scores 
will also be explored to answer further the question whether fluctuations 
in the authoritarianism and dogmatism of samples is important in research 
involving coVleoe students preparing to be teachers. The review of liter- 
ature indicatesthat generally one should expect to find a low, but nega- 
tive relationship between F- Scale scores and intelligence. With the Dogma- 
tism Scale, the relationship should be anticipated as low and negative, if 
significant at all. While it does seem fairly clear that both authoritar- 
ianism and dogmatism decrease with increases in level of education, the 
relationship between age and the scores on the two samples is not. clear cut. 
By the same token, there are conflicting findings when the scores of males 
and females have been compared. The research also provides no basis for 
expecting that elementary and secondary teachers will differ in their F- 
and D-Scale scores. However, there is a paucity of research findings bear- 
ing on the relationship between the subject fields in which prospective 
teachers will teach and their authoritarianism and dogmatism. Data from 
the present study should shed light on the relationships between each of 
these variables and F- and D-Scale scores. 

Finally, there is clear evidence that the F-Scale does pick up differ- 
ences between cultures. This has been true in comparing Americans with 
Arabs, Jananese, and Germans. Few intercultural studies are available with 
the Dogmatism Scale. That part of the present study involving the sample 
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of German teacher trainees will serve to extend the intercultural evidence 
on both scales. The analysis should, based on past findings, indicate 
higher mean F- and D-Scale scores for the German sample as comoared to the 

American one. 

As a postscript, it might be well to mention the consistently high 
relationships between eric r- and D-Scales which have been reported in the 
literature despite the number of factor analyses that seem to indicate 
that dogmatism and authoritarianism are discriminable. The present study 
has produced intercorrelations between the two scales, as well as between 
each and the Gough-Sanford Rigidity Scale which played a central role in 
Rokeach's empirical distinction of dogmatism and rigidity. These findings 
will be reported for those interested in the question of the validity of 

the scales. 




IV: OPEN-CLOSED MINDEDNESS AND THE METHODS COURSE— THE STUDY 



As has already been noted in the first chapter, the research to be re- ( 
ported here is an outgrowth of curiosity about the factors related to students 
reactions to a social studies methods course. The studies were exploratory 
and correlational in nature, in an attempt to gain some insights into the 
tendency for students to make rather strong verbal reactions to the methods 
classes taught by the senior investigator. 

In looking back through the students' evaluations for these courses, 
it has been heartening to be reminded that, almost without exception, when 
asked to indicate whether there is something you believe the instructor has 
done especially well in his teaching of the course, the students responded 
with some variation on the theme, "He has stimulated me to think more criti- 
cally about educational issues." Yet, there is still the nagging memory 
(having moved on into the area of research, undergraduate methods courses 
are no longer a part of the senior investigator s teaching assignment) that 
many students were not happy with the course. The reasons are no doubt 
partly indicated by common responses to a query on the evaluation forms for 
"specific things you believe might be done to improve the instructor s teach- 
ing of this course." More specificity in assignments, lectures, and tests 
were often called for. The lack of specificity was a part of the attempt 
to set the stage for the emphasis on thinking and analysis which evoked 

positive comments. 

In this context, measures of dogmatism and authoritarianism seemed to 
have particular potential for an exploratory study. The variable of cog- 
nitive rigidity also held promise, although from the first, it was included 
in the study primarily to provide a check on Rokeach's distinction between 

dogmatism and rigidity. 



The Subjects and the Setting 

Three sections of the social studies methods course were involved in 
the research. Group 1 (N = 26) took the course during the Spring Quarter, 
1963; Group 2 (N = 33) took the course Fall Quarter, 1963; Group 3 (N - 30) 
took the course Spring Quarter, 1964. To the consternation of those instruc- 
tors who like classes adorned by feminine pulchritude, few female students 
major in secondary school social studies at Utah State University. I" this 
investigation. Group 1 had only two females; Group 2 had four, and. Group 
3 had 9. Generally, the students were juniors who had not yet done their 
student teaching, although an occasional senior was enrolled. 



The course was a three-hour class, meeting three times a week for a 
fifty-minute period. There was no assigned textbook, although Journal arti- 
cles and sections from books were assigned for reading. (A book by Shaver 
and Berlak, 1968, presents readings guite representative of those used in 
the course ) Class meetings were typically devoted to the discussion of 
issues in social studies education. Although the assigned readings were rele- 
vant to the discussions, only infreguently did a discussion focus on the 
analysis of a specific reading. Lectures were presented by the instructor 
at no more than one out of four or five class meetings. 
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No systematic data describing the class meetings were collected, 
although it was recognized that such information could be valuable in inter- 
preting the findings (Shaver, 1964). The exploratory nature of the study 
and the limited resources simply did not make the development of an instru- 
ment for scoring classroom behavior or the hiring and the training of obser- 
vers feasible. If a description of teaching style presently in the literature 
could be applied to the class discussions, however, it would undoubtedly be 
that of a socratic teacher, as compared to a recitation one (Oliver & Shaver, 
1966; Shaver, 1964). Students were prodded to take stands on issues in social 
studies education and then forced to defend their stands, with special atten- 
tion paid to possible inconsistencies in their beliefs and values. 



The Data 

The F-, D-, and R-Scales were combined in one testing instrument for 
this study. Form 40-45 of the F-Scale (Adorno, 1950, pp. 255-257) was used 
with one modification. Item No. 22, "It is best to use some prewar author- 
ities in Germany to keep order and prevent chaos," seemed historically out 
of context for our college samples. Moreover, of all the items, it seemed 
most likely to be reacted to in a different context by the subjects in our 
German samples. Consequently, that item was dropped from the scale. A 
replacement was not constructed. This means that the F-Scale means reported 
in this study are based on a 28 rather than a 29-item scale. This should be 
remembered in comparing our data against those from other studies. Such 
comparisons should be made on the basis of group means converted to mean 
scores per item. While dropping the one item has disadvantages, those of 
constructing a replacement with attendant questions of validity seemed more 
serious. 

The testing instrument, which included the 28-item F-Scale, the D-Scale 
(Form E) and the R-Scale (as presented in Rokeach, 1960, pp. 418-419), was 
administered during regular class periods toward the beginning of each quar- 
ter. Administration took about 25 minutes. The reliability coefficients 
for the tests (split-half, corrected with the Spearman- Brown Formula) for 
the three groups combined were: F-Scale, .74; D-Scale, .82; R-Scale, .81. 

In groups 2 and 3, the Otis Quick-Scoring Mental Ability Tests: New 
Edition, Gairma Test was administered as an estimate of mental ability that 
would consume little class time. 

During one of the last two class meetings, the University of Washington 
Survey of Student Opinion of Teaching was administered to obtain student 
evaluations of the instructor. The first part of the Survey has ten items 
(see Table 2) on which each student rates the instructor. The instructor is 
to be compared with other teachers the student has had, using the following 
scale: 1 = outstanding, 2 - superior, 3 = competent, 4 = fair, 5 = of less 
value. The numerical order of the scale was reversed in scoring the eval- 
uations so that correlation coefficients would be easier to interpret. Each 
item was included in the analysis along with a total score which was the sum 
of the items. 

The second part of the Survey contains ten more items. The first group 
of ten items on the Survey is the outgrowth of factor analyses carried out 
at the University of Washington (Guthrie, 1954). These are now kept constant 
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TABLE 2 

Items from the University of Washington Survey of 
Student Opinion of Teaching Used in the Study 




1. Interprets abstract ideas and theories clearly 
2*. Gets me interested in his subject 

3. Has increased my skills in thinking 

4. Has helped broaden my interests 

5. Stresses important material 

6. Makes good use of examples and illustrations 

7. Has motivated me to do my best work 

8. Inspires class confidence in his knowledge of subject 

9. Has given me new viewpoints or appreciations 

10. Is clear and understandable in his explanations 



C. Is willing to give individual attention 

F. Has a good sense of humor 

G. Is fair in grading 

H. Has presented many thought-provoking ideas 

I. Has given me new tools for attacking problems 

J. Shows respect for questions and opinions of students 



* 






for the University's own use of the Survey in its instructor evaluation 
program. The items in the second group, however, are varied occasionally 
in the hopes of turning up better discriminators. From the second group of 
items six of interest were selected for inclusion in the analysis. These 
items are also presented in Table 2. Again, the individual items were 

summed fcr a total score. 



Other items taken from the Survey for analysis were an estimate by the 
student of the grade he would receive in the course (coded A = 4, B =3, 

C = 2 D = 1 , E = 0), a grade for the instructor (A, B, C, D, or E) to be 
bas^d’on his "help" to the student in his "efforts to learn this subject, 
and a three point rating of the subject matter of the course (highly inter- 
esting, moderately interesting, not very interesting). 

For each of the three groups, the students' grades in the course were 
used as a criterion of learning. In addition, for Group 2 there were avail- 
able two other measures of learning: a gain score on a 3 : item multiple 

choice test and a qain score on the Social Issues Analysis Test No. 1 
(Oliver & Shaver, 1966) aimed at determining the student's ability to analyze 
the analytic functions indicated by statements in a discussion. In retrospect 
after the completion of the course with Group 2, neither test seemed to 
reflect adequately the objectives of the course, so they were not admini- 
stered to Group 3. The project was not planned, nor was funding provided, 
for further test development. 
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Results 



The findings of primary interest are those having to do with the rela- ( 
tionships of F-, D-, and R-Scale scores to the variables reflecting students 
reactions to the methods course. However, some descriptive data on the 
three groups involved in the study will provide background for the correla- 
tional results. The means and standard deviations for the tnree groups on 
the F-Scale, D-Scale, R-Scale, and the Otis Test are presented in Table 3. 



TABLE ^ 

Means and Standard Deviations on the F-, D-, and 
R-Scales and the Otis Test 





F-Scale a 


D-Scale 


R-Scale 


Otis 




Mean b S.D. 


Mean 0 


S.D. 


Mean 0 


S.D. 


Mean** S.D. 


Group 1 


88.58 17.08 


149.74 


23.55 


85.21 


14.76 




Group 2 


100.03 17.74 


145.70 


22.89 


92.16 


13.65 


114.45 8.82 


Group 3 


88.00 20.13 


143.56 


21.07 


87.68 


13.09 


119.32 7.86 


a Twenty-eight item Scale 


(see text) 


• 








b With d.f 
P < .05 


. = 2/74 and F = 
> .01. 


3.87, the difference among 


the means 


is significant. 


c Difference among the means not significant 


at the . 


05 level . 





With d.f. - 1/56 and F = 4.75, the difference between the means is signifi- 
cant, P < .05 > .01. 



The variability in the central tendency of scores from class to class is of 
particular interest. 

Correlations among the F-, D-, and R-Scales are presented in Table 4. 
Generally there are no surprises here. The correlations between the F- and 
D-Scales fall with the range reported by Rokeach (1960, p. 121) Tor his stu- 
dies. Rokeach (I960, p. 193) noted that in his various studies, the corre- 
lations between the D- and R-Scales had ranged from .37 to .55. The corre- 
lations between those two variables in this study exceed that J' an 9 e > with 
one of only .27 and the highest one .76. This suggests that whether dogmatic 
and rigid thinking are independent as proposed by Rokeach is quite likely 
dependent upon the particular sample tested. The moderate correlations 
between F-Scale and R-Scale scores suggest a greater degree of independence 
than was the case with the D- and R-Scales. 




-43- 



TABLE 4 



Correlations Among the F-, 


D-, and R-Scales 




Group 

1 


Group Group 

2 3 


1. F-Scale 






2. D-Scale 


.70* 






.61 






.73 




3. R-Scale 


.49 


.27 




.46 


.62 




.58 


.76 



♦Coefficients listed in descending order— Group 1 
first. Group 2, second. Group 3 third. 



The correlations between the three personality scales and the Otis 
Test scores are of interest given the concern in the literature for author- 
itarianism-intelligence and dogmatism-intelligence relationships. Inspection 
of Table 5 Indicates no finding supportive of any hypothesized relationship 
hetween intelligence and either authoritarianism or dogmatism. With the 
smlHaroles involved in this study, none of the correlation coefficients 
insignificantly larger than zero. It is worth noting, however, that in 
pvprv case the sion is negative. This supports the conclusion in the 
review of literature that if any relationships exist, they are low and nega- 

tive. 



TABLE 5 

Correlations Between Otis I.Q. Scores 
and F- , D-, and R-Scales 



F-Scale D-Scale R-Scale 



-.17 -.19 -.12 



* 



Group 2 
Group 3 



-.23 



-.01 



-.04 
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Means and standard deviations for the students' evaluations of the 
instructor on the University of Washington Survey of Student Opinion of 
Teaching are contained in Table 6. It will be recalled that the individual 
items used a five point rating scale, scored so that the highest rating 
received a value of five and the lowest a value of one. The rating of inter- 
est in the subject matter of the course was on a three point scale. 

Generally, the mean ratings are at or above the median scale value. 

Means and standard deviations do not provide information for testing the 
instructor's impression that students tended to have rather strong positive 
or negative reactions to the courses. If his perception was correct, the 
distributions of scores on the Survey could be expected to be bimodal. 
Inspection of Table 7 does not reveal bimodal distributions for the two 
total scores obtained by summing ratings for the two groups of Survey items. 
Yet, the distributions do tend to be platykurtic, with extreme scores extend- 
ing the distributions. While the presence of strikingly bimodal distribu- 
tions would have been much firmer confirmation of the instructor's impressions, 
the shapes of the distributions do lend support for his conjecture that 
reaction to the courses was diverse. 

It is interesting to note that there is a rough correspondence between 
fluctuations in the means for the rating scale totals ^nd the grade assigned 
the instructor and the decrease in D-Scale means (Table 3) from Group 1 to 
3. This rough association must be interpreted carefully, of course. A 
similarly negative association between mean course grades (Table 8) and dog- 
matism means is noticeable, and there is no way of telling whether any 
trend toward higher ratings of the instructor is due to a decrease in dog- 
matism or to a possibly more favorable classroom climate when a group of 
students is going to receive higher grades. 

Intercorrelations among the various scores from the University of Wash- 
ington Student Opinion Survey are presented in Table 9. Item intercorrela- 
tions for both parts of the Survey are generally moderate and positive, and 
frequently significant. This indicates that there is considerable indepen- 
dence among the items. 

The relatively high correlations of individual items with the total 
score for the first part of the Survey are encouraging; the item-total corre- 
lations are not so consistently high for the second part of the survey, 
raising some question about the validity of that total score. The low corre- 
lations between the total scores for the two parts are surprising (.39, .42, 
and .19), but they do justify not pooling the total scores as one variable. 

A reflection of the low relationship between the two total scores is the 
relatively high correlations of the first part total with the grade given 
the instructor, as contrasted with the low correlations for the total of 
the second group of items. 

Some indication of the validity of student responses can be had by 
looking at the items which correlate with the students' indication of inter- 
est in the subject matter. Item 4, "Gets me interested in his subject," 
was significantly correlated with the subject matter interest score in 
every class. Other items producing at least two significant correlations 
had to do with the instructor's success in broadening interests and deve- 
loping new viewpoints and appreciations. The correlation of the total 




-45- 



TABLE 6 

Means and Standard Deviations for the 
University of Washington Survey Scores 





Group 1 
Mean S.D. 


Group 2 
Mean S.D. 


Group 3 
Mean S.D. 


I tern- 1 


3.42 


.84 


3.60 


.86 


3.88 


1.01 


(Interprets ideas) 
I tern- 2 


3.16 


1.01 


2.91 


1.07 


3.60 


.86 


(Gets me interested) 
Item- 3 


3.68 


.82 


3.64 


.99 


3.84 


.75 


(Skills in thinking) 
I tern- 4 


3.17 


.71 


3.24 


1.12 


3.48 


1.19 


(Broaden interests) 
I tern- 5 


3.42 


.61 


2.88 


.78 


3.52 


.96 


(Important material) 

Item-6 . . 


3.63 


1.01 


3.15 


1.00 


3.88 


.93 


(Examples and illustrations) 
I tern- 7 


2.68 


.58 


2.61 


1.02 


2.92 


1.04 


(Motivated me) 
I tern- 8 


3.32 


.94 


3.54 


1.06 


3.75 


.94 


(Confidence in knowledge) 
Item-9 


3.68 


.88 


3.82 


.95 


3.92 


.90 


(New viewpoints) 
Item-10 


3.21 


.79 


3.00 


.97 


3.42 


.97 


(Clear explanations) 














Total 


33.05 


4.58 


32.39 


6.65 


36.44 


7.19 


Item-C 


3.58 


.96 


3.58 


1.00 


3.96 


1.17 


(Individual attention) 
I tern- F 


3.68 


.67 


3.48 


1.00 


4.04 


.84 


(Sense of humor) 
Item-G 


2.68 


.94 


2.87 


.76 


2.96 


1.16 


(Fair in grading) 
Item-H 


3.63 


.90 


3.52 


1.15 


4.08 


1.00 


(Thought-provoking) 
I tern- 1 


3.32 


.82 


3.42 


.97 


4.00 


.95 


(New tools) 
Item-J 


3.32 


1.00 


3.54 


1.03 


3.88 


.83 


(Respect for students) 














Total 3 


20.21 


3.01 


20.24 


4.45 


25.21 


10.66 


I ns true. Grade 


3.94 


.72 


3.91 


.73 


4.04 


1.12 


Subject Matter 


2.16 


.60 


2.16 


.73 


2.43 


.66 


Expected Grade 


3.50 


.51 


3.63 


.67 


3.57 


.60 



a with d.f . = 2/73 and F = 4.35, the difference among the means is significant, 
P < .05 > .01 . 



o 

ERIC 
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TABLE 7 



Distribution of Ratings of Instructor on University of 
Washington Survey- -Totals for Two Groups of Items 



Total 
Rating 



Survey Total 1_ 
G-l* G-2 G-3 



Groups 
Combi ned 



Survey Total 2. 
G-l G-2 G-3 



Groups 
Combi ned 



55 

54 

53 

52 

51 

50 

49 

48 

47 

46 

45 

44 

43 

42 

41 

40 

39 

38 

37 

36 

35 

34 

33 

32 

31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 

15 

14 

13 

12 



1 



1 

2 

2 

1 

1 

2 

2 

3 

1 

1 

2 

1 



1 1 



1 1 



1 1 
1 



2 

2 

3 

2 

2 

1 

1 

2 

2 

2 

3 

1 

2 

2 

1 

1 



3 

2 

2 

1 

2 

2 

1 

2 

3 



1 



1 



1 



2 

1 

1 

2 

3 
2 
6 
6 
1 
6 
2 

4 

5 

6 
6 
3 

3 
2 

4 
4 
1 
1 

1 



1 



1 



1 

2 

3 



1 

2 

2 

2 

1 

5 

1 

1 

1 

1 

2 



2 

5 

1 

7 

2 

2 

1 

4 

1 

2 



1 

3 

3 

2 

2 

1 

1 

1 



1 



1 

3 

3 

1 

6 

10 

5 

11 



1 

5 



2 



2 



2 

2 




4 - 



* 




* G-l stands for Group 1, G-2 for Group 2, etc. 



rooiroco-^ 
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TABLE 8 




Means and Standard Deviations for Course Grades, 
Multiple Choice Gain Scores, and 
SI AT No. 1 Gain Scores 



Group 1 
Mean S.D. 



Group 2 
Mean S.D. 



Group 3 
Mean S.D. 



Course Grades 3.11 


.57 


3.15 


.87 


3.40 


.86 


Multiple Choice 




5.00 


1.98 






Gain 

SI AT #1 Gain 




5.75 


2.46 







score for the first part of the Survey with interest in subject matter also 
tends to corroborate the validity of that score as compared to the total 
score for the second part. Of course, the second part of the Survey con- 
tains fewer items and their content is different from those in the first 
part. Most plausibly, the differing results with the two total scores 
reflect the use of factor analysis as the basis for selectyig the first 
set of items. It will be recalled that these come from a factor analytic 
study at the University of Washington, while the items in the second set 
are more experimental in nature. 

What of the relationships between F, D, and R scores and student reac- 
tions to the course? Table 10 presents the correlations between F, D, R, 
and Otis scores and the items on the Student Survey. There are few corre- 
lations of even a moderate magnitude, and few of these are significant. The 
total score for the first ten items (which appears to be the more re liable 
and valid of the two total scores) is not correlated at the *05 level of 
significance with any of the four tests. In fact, not only do the correla- 
tions fail to depart significantly from zero, but their numerical values are 

extremely close to zero. 

There are three significant correlations for the F-Scale. Two of these- 
for Groups 1 and 3-are with Item C, “Is willing to 

tion " These few correlations hardly give credence to the notion that there 

is some consistent relationship between the students' T significant"^ 

how they will evaluate an instructor, for one coefficient is significant and 

negative, and the other is significant and positive. The 
tive correlation between the total score for the second part of the Survey 
and the F-Scale reflects the correlation with Item C and the generally nega- 
tive, even though small, correlations between the F-Scale and the individual 
items This consistency is in contrast with the correlations between 
scores and Hems on the first part of the Survey for the same group. 



"Has 
any for 



Dogmatism scores correlated significantly with only one item-item 3, 
increased my skills in thinking." Chance is as good an explanation as 



this one correlation. 
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Coefficients listed in descending order-Group 1 first. Group 2 second. Group 3 third. 
Significant at the .05 level 
Significant at the .01 level 
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TABLE 10 

Correlations Among OTIS I.Q., F-, D-, and R-Scale and 
University of Washington Survey Scores 



Item-1 

(Interprets Ideas) 



Item- 2 

(Gets Me Interested) 



I tern- 3 

(Skills in Thinking) 



Item-4 

(Broaden Interests) 



I tern- 5 

(Important Material) 



I tem-6 

(Examples and Illustrations) 



I tern- 7 

(Motivated Me) 



Item-8 

(Confidence in Knowledge) 



Item-9 

(New Viewpoints) 



I tern- 10 

(Clear Explanations) 



I . Q . 


F 


D 


R 


* 


.18 


.16 


.27 


-.03 


.18 


.23 


.15 


-.14 


.06 


.26 


.28 


... 


.07 


. 05 l 


. 0 / 


.11 


-.13 


. 51 b 


.11 


-.14 


-.11 


-.20 


-.21 


... 


-.03 


.00 


.16 


.11 


.03 


. 34 a 


.27 


• 

ro 

00 


-.08 


.07 


-.04 


_ . . 


.26 


.45 


. 51 ' 


-.13 


.09 


.29 


.33 


.20 


-.15 


.05 


.03 


. . . 


.11 


.08 


.19 


.07 


-.02 


.15 


.05 


-.20 


-.01 


.15 


.08 


__ . 


.19 


.06 


-.12 


.02 


.08 


.19 


.11 


.06 


.19 


.27 


.25 


... 


.14 


.06 


.11 


-.28 


-.06 


.10 


.08 


.11 


-.03 


.31 


.22 


... 


.10 


.00 


-.26 


.15 


-.16 


.10 


-.03 


-. 42 ® 


.12 


.18 


.36 


. . . 


.06 


.23 


.12 


.01 


-.14 


.28 


.10 


.22 


-.31 


-.04 


-.06 


... 


.15 


-.18 


-.01 


.20 


-.05 


.17 


.05 


.06 


-.24 


-.05 


-.09 


... 


.10 


.20 


.09 


.03 


-.04 


.28 


.19 


.01 


-.18 


.10 


.01 




Total 
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TA3LE 10 
(Cont. ) 





I.Q. 


F 


D 


R 


Item-C 


— — — 


-.61 b 


-.32 


-.15 


(Individual Attention) 


-.09 


- .08 


.08 


-.16 




-.01 


.40 a 


.46 


.35 


Item-F 


... 


-.36 


.05 


-.51 a 


(Sense of Humor) 


.15 


-.16 


-.13 


-.02 




.04 


.11 


.30 


.25 


Item-G 


... 


-.20 


-.37 


-.06 


(Fair in Grading) 


-.24 


-.33 


-.21 


-.33 




.31 


-.14 


.16 


.008 


Item-H 


... 


-.32 


-.34 


-.28 


(Thought-provoki ng ) 


-.22 


-.06 


.17 


.13 




.16 


.09 


.22 


-.02 


Item- I 




-.24 


-.15 


-.30 


(New tools) 


-.20 


.03 


.11 


-.01 




19 


-.04 


.15 


-.02 


I tern- J 


... 


-.14 


.03 


.13 


(Respect for Students) 


.05 


-.14 


.00 


-.16 


.13 


-.11 


.18 


.08 


Total 


... 


- .55 a 


-.34 


-.30 




-."!4 


-.12 


.01 


-.11 




-.04 


.27 


.16 


.08 


Inst. Grade 


... 


-.21 


-.08 


.29 




-.10 


-.10 


.12 


.04 




.09 


.13 


.00 


.08 


Subj. Matter 


... 


.18 


.21 


.40 


-.17 


-.27 


.10 


.18 




-.30 


.19 


.11 


-.06 


Expect. Grade 


- - - 


.35 


.27 


-.09 


.02 


-.36 


.13 


.06 




.42* 


-.18 


.04 


-.11 



♦Coefficients listed in descending order— Group 1 first. Group 2 second. 
Group 3 third. 



a Significant at .05 level 
b Significant at .01 level 
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Gough-Sanford Rigidity scores were involved in two significant correla- 
tions— identical in numerical value and opposite in sign. The positive 
correlation is with Item 4, "Has helped broaden my interests, and the nega- 
tive correlation with Item F, "Is fair in grading." In neither instance is 
there much sign of a confirming correlation with one of the other two groups, 
and there is no particular reason why these two, among all the items, should 
be correlated with cognitive rigidity. 



In short, the correlations between F, D, and R scores and the Student 
Survey scores lend little support to the idea that authoritarianism, dogma- 
tism, or rigidity are related to students' evaluations of an instructor in 
a revamped, inquiry-oriented social studies methods course. 



The same must be said about relations with learning. If one accepts 
variability in course grades as a criterion of learning, the coefficients 
reported in Table 11 reflect the relationships one might expect between 
measures of learning and Otis I.Q. scores (with the exception of the Mul- 
tiple Choice Gain Score, which is of doubtful validity anyway). There 
are, however, no significant correlations (not even any sizeable ones) 
between the F-, D-, and R-Scale scores and the three learning criteria. 



TABLE 11 



Correlations Among Course Grades, Multiple Choice Gain, 
SI AT No. 1 Gain, Otis I.Q., 

F-, D-, and R-Scale Scores 





I.Q. 


F-Scale 


D- Scale 


R-Scale 


Course Grades 


* 

.48 b 

.49® 


-.01 

-.26 

-.03 


.08 

-.03 

.03 


.10 

-.04 

.14 


Multiple Choice 
Gain 


w* tm m 

-.03 


-.17 


-.24 


1 CO l 
1 CO 1 
1 • 1 
1 


SIAT No. 1 Gain 


.40® 


-.17 


-.04 


.02 




“ " “ 


... 




... 



♦Coefficients in descending order— group 1, 2, and 3. 
a Significant at the .05 level. 
b Significant at the .01 level. 



V: OPEN-CLOSED MINDEDNESS IN TEACHER EDUCATION STUDENTS--THE STUDY 



Given the tendency for educational and psycholgical researchers to con- 
duct their investigations on conveniently available samples, it seems impor- 
tant to develop some understanding of the relationships between other sample 
characteristics and personality traits such as dogmatism and authoritarianism. 
This study has been concerned with the relationships, in samples of teacher 
education students, between dogmatism, authoritarianism, and rigidity and 
sample characteristics such as religious affiliation, choice of elementary 
or secondary level for teaching, teaching major, sex, and age. ihe intent 
has been to provide researchers with some insights into possible sources 
of bias in the dogmatism, authoritarianism, and rigidity scores of their 
samples . 



The Subjects 

In an attempt to obtain a nationwide sampling of students in teacher 
education, a number of colleagues (listed in the Acknowledgements) were con- 
tacted at different institutions and asked to help gather data. In only one 
instance, involving a Catholic college for women, was the request refused. 

It seemed unrealistic to ask the cooperating colleagues to obtain random 
samples of the teacher education studentbodies at their respective institu- 
tions, so instructions were set up to help insure representative sampling. 

In most universities students take their professional education courses dur- 
ing their junior and senior years, and presence in these classes is good 
evidence of an intent to teach. Therefore, the cooperating colleagues were 
requested to obtain samples from junior and senior year education courses. 
They were also asked to test in as many classes as feasible. The results, 
as must always be the case when one relies on busy professors to do one s 
work, were not always completely as hoped for. 

Because he was on the faculty at Utah State University at the initiation 
of the study, the senior investigator was able to make his own contacts for 
testing in the general methods and curriculum courses offered at that insti- 
tution. Then, during a year's tenure on the faculty of The Ohio State Uni- 
versity, he was able to arrange for testing. Consequently, those two sam- 
pies were the largest (U.S.U. undergraduates, N = 390; O.S.U., N - 317).^ 
During a summer's professorship at the University of Washington, the senior 
investigator also obtained data from the students (N = 34) in the classes 
he taught there. Also, he was asked to provide the combined F-, D-, and R- 
Scale instrument in two instances— one for an unpublished predictive study 
with graduate students at Utah State University, the other for assessment 
purposes with a group of 57 Peace Corps volunteers undergoing training at 
U.S.U. — and, in turn, requested the data from the test administrations. In 
addition, samples were obtained from Boston University (N = 122), the Uni- 
versity of Oklahoma (N = 118), the University of Michigan (N = 85), the 
University of California at Santa Barbara (N - 66), and the Harvard Graduate 
School of Education (N = 108). 

Some descriptive data on the samples are presented in Tables 12 through 
16. It probably is not necessary to mention that numbers from the different 
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tables will not always match or check out because all information was not 
available on every student. Inspection of these tables reveals some of the 
weaknesses in the samples. Although one would expect a sampling of teacher 
education students to contain more women than men, the ratio is particularly 
out of balance in the Boston University, University of California, and the 
University of Michigan samples (Table 12). The reason for these results can 



TABLE 12 



Sex of American Subjects 



University 


Male 


Female 


Boston 


7 


115 


California 


10 


56 


Harvard 


29 


79 


Michigan 


15 


70 


Oklahoma 


24 


91 


Utah State 


148 


243 


Utah State— Grad. Ss. 


78 


31 


Washington 


20 


13 


Ohio State 


103 


213 


Peace Corps 


40 


17 


Total 


474 


928 



be seen in Table 13. In each instance, the sample is either entirely or 
heavily from elementary education courses, and elementary education tra- 
ditionally attracts an even greater proportion of women than does secondary 
education. 

Table 14 reveals another discrepancy between intent and sample. The 
University of Michigan sample is made up primarily of persons who had com- 
pleted their bachelor's work, had some teaching experience, and had come back 
for more course work in extension classes. The sample of U.S.U. graduate 
students and those from the University of Washington had not been contem- 
plated in the original design of the study so are "added bonuses" that need 
to be kept in mind for their discrepant nature. The Harvard sample is a 
graduate one because Harvard's teacher education program enrolls practically 
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TABLE 13 

Number of American Subjects in 
Elementary or Secondary Education 



University 


Elementary 


Secondary 


Boston 


118 




California 


47 


18 


Harvard 


34 


74 


Michigan 


62 


19 


Oklahoma 


27 


88 


Utah State 


151 


233 


Utah State— Grad. Ss. 


2 


27 


Ohio State 


109 


182 



TABLE 14 

Year in College of American Subjects 



University 


Soph. 


Jr. 


Sr. 


Grad. 


Boston 




107 


14 




California 




48 


11 


7 


Harvard 








109 


Michigan 






3 


82 


Oklahoma 




28 


80 


1 


Utah State 


6 


52 


307 


22 


Utah State— Grad. Ss. 








109 


Washington 








34 


Ohio State 




15 


287 


13 
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no one but college graduates with liberal arts degrees and no teaching exper- 
ience, seeking the Masters of Arts in Teaching. The negligible correlations 
of the F-, D-, and R-Scales with age and year in college (Table 17) suggest 
that the occasional use of graduate students in the analyses which follow 
should not have had an effect cn the results. 

Table 15 indicates that there was also considerable variation from uni- 
versity to university in the teaching majors of the students tested. This 
variability will need to be kept in mind in interpreting analyses of differ- 
ences among the subjects with different majors. As the data in Table 16 
indicate, there is considerable variability in the religious groups present 
in the samples from different universities. It would be extremely easy to 
confound subject matter and religious differences in analyses when all or 
most of the subjects in one teaching major came from a university, such as 
Utah State University, where one religion dominates the sample. 

One other possible deviation in sampling should be noted. While all 
other tests were administered during regular class periods, those for the 
Harvard elementary education students were handed out to be completed at 
home and returned to the instructor on a voluntary basis. Volunteers might 
differ from nonvolunteers in their scores on the three personality scales 
used in this study. However, analyses of the data to be presented later in 
this report revealed no significant differences in mean F, D, or R scores 
for the Harvard elementary education volunteers and the Harvard secondary 
education group, who were nonvolunteers. 



The Data 

Like the data reported in the previous chapter, those for this part of 
the study were collected using an instrument containing combined F-, D-, 
and R-Scales. The same forms of the three scales, including the 28-item 
F-Scale were used. Answer sheets included spaces for the subjects to fill 
in their sex, religion, teaching major, and age, as well as places to check 
year in school and whether they were in elementary or secondary education. 

Reliability coefficients (split-half, corrected with the Spearman- 
Brown formula) for the three scales for the total sample (N = 1297), exclud- 
ing the Utah State University graduate students, were: F-Scale, r = .55; 

D-Scale, r = .90; R-Scale, r = .79- 

The low reliability coefficient for the F-Scale as compared to the two 
other scales is perplexing because, typically, higher coefficients have been 
reported in the literature (e.g., Kerlinger & Rokeach, 1966, r .82; Lam- 
bert, 1960, r = .88; Rokeach & Fruchter, 1956, r = .84; Rowan, 1968, r = .86 
over an 8-year lapse in time). Although the coefficient for the F-Scale 
was also lower for the U.S.U. methods course study reported in the previous 
chapter, it was at least "respectable" (r = .74). Even with the German sam- 
ple, the reliability coefficient was higher (r = .76), although again it was 
the lowest of the three. Inspection of the data and rerunning the correla- 
tions have provided no reason to discount the low reliability coefficient 

There is, however, some evidence that the low reliability estimate for 
the F-Scale may be due to a chance poor split of items for the split-half 
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TABLE 15 



Subject Majors of American Subjects 



Teaching 

Major 


Bost.* Cal . 


Harv. 


Mich. 


Okla. 


USU 


USU 

Grad.* Wash. 


Ohio 


Total 


Art 




1 




5 


18 


2 


1 


27 


Business Educ. 






1 


11 


29 


1 


4 


46 


Engl i sh 




36 


5 


5 


45 


7 


33 


131 


Math 






1 


10 


17 


1 


14 


43 


Music 






2 




7 




22 


31 


Physical Educ. 






2 




40 




13 


55 


Science 




9 




11 


30 


2 


35 


87 


Social Studies 


15 


30 


5 


13 


37 


17 


49 


166 


Library 






1 




1 






2 


Home Econ. 






1 


10 


8 


1 


11 


31 


Core 












1 




1 


Psychology 










6 






6 


Languages 








14 


12 




7 


33 


Speech 








7 


15 




7 


29 


Counseling 














2 


2 


Vocational Educ. 










5 




1 


6 


Tech. & Indust. 
Educ. 








1 


1 




1 


3 


Agri cul tural 
Educ. 










1 






1 


Special Educ. & 
Remed. Reading 






5 


1 






10 


16 


Miscellaneous 








2 






c 

%0 


7 



* Boston University elementary students did not indicate teaching major, and 
teaching major data was not available for Utah State University graduate students. 





-59- 



CA 

-M 

0 

01 



.a 

3 

CO 



iO 



C0 

u 



LaJ 



CO 

c 



01 

I 



4 - 

o 



o 

•T— 

cn 



01 

o: 



CO 

+j 

o 



01 

u 

<o 

01 

o. 



o 

•r— 

o 



CA 

CO 



01 

4-> 

CO • 
-M ca 
CO T5 

<o 

x: s. 

C0 CD 



01 

4-> 

<D 

-M 

CO 



to 



to 



O 



> 

s_ 

<0 



O* 



00 

00 



00 

in 



<\j 

m 



CVJ i— 
CM 



o 



o 

CM 



CM 



o> 

CM 






KO 

CM 



00 



CM 



i— 00 



in 



00 



I— CM 



00 



00 

CM 



00 

CM 



0s. 



ID 

00 



00 



00 



oo 



00 



00 



in 



CM 



CM 



CM 



00 



oo 



I— CM 



o •— 
00 



CM 



I— 00 



CM 



00 00 



CM 



i— CM i— ID CM 



CM 



^0 

00 



O 00 



00 



CM 



00 

CM i— 



CM >— ■— 

CM 



0s. OS 



id o m 

CM i— 



^1- CM 



r- 00 



00 

00 



in 



oo oo 



in 



to 

o 



(A 

o 

00 



o 

•r— 

o> 



01 

ad 



ID 



CM 



oo 



00 






in 



oo 



in Os. rs. 

r- N 



f— 00 CM 



i— u> <t 



c 

to 



c 

to 



4-> i— 

(A O 

O -c 

C 4-> 

C7> CO 

C O 



<0 

o. 

O -c 
U (A 

(A f- 

t- £ 

O. 01 

LU rD 



c 

o 

g 

o 





4 -> 


&. 


c 


c 


CA 


01 


CO 


CO 


•»— 


4 -> 


•r— 


u 


•o 


>> 


s. 


01 


o 


n 


CO 






CA 


4 -> 


4 -> 


4 -> 


0 > 


•»— 


3 


01 


s_ 


c 


-J 


21 


o. 


3D 



>> 01 
4-> U 



C 4-> 
3 01 
E ^ 
E oo 
o 
o 



CO 

4-> 

CA 

01 

4-> 

o 

s. 

o. 



4-> 

CA 



4-> 

Ol 

CO 

oo 






o 

ERIC 



Congregational 



TABLE 16 

Religion of American Subjects 
(Cont.) 
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compu tat ions for this study. Cronbach (1967) has arguea that the reliability 
coefficient obtained by the Kuder-Richardson coefficient of equivalence 
(Kuder & Richardson, 1937) is the mean of all of the splits of a test that 
might be made in computing the split-half coefficient. This mean is an aver- 
age that is based on poor as well as good splits, with obtained split-half 
coefficients falling both below and above the mean A coefficient of equi- 
valence was computed fo^ the F-Scale using the Kuder-Richardson Formula 21 
and treating the total possible points on the test (7 x 28 = 196) as n, the 
number of items (assuming n to mean, total possible points) The coefficient 
obtained was .90. (The coefficients of equivalence for the D- and R-Scales 
were .88 and .86, respectively ) In addition, the magnitude of the corre- 
lation (r - .70) between the F-Scale and the 0- Scale (Table 17) suggests 
that the true reliability of the F-Scale is higher than the split-half es- 
timate of 55. 



Results 

The intercorrelations among the three scales, year in college, age, and 
sex (scored as a two-point dummy variable with male = 1, female - 2) are 
presented in Table 17. As with the correlations for the social studies 
methods course samples reported in Chapter IV, there are no surprises. The 
correlation between the F- and D-Scales is high, but within the range report- 
ed in previous studies. The correlation between D and R scores is within 
the range of those reported by Rokeach (I960, p. 193). The correlations be- 
tween age and the F-, D-, and R-Scales are negligible. The same is true 
for sex. The coefficients for sex approach significance for the F- and D- 
Scales because of the large number of subjects involved (N - 1403 for these 
correlations) Although the negative relationships indicate that males had 
higher scores on the scales, the correlation is so slight as to be mean- 
ingless for explaining variability in either authoritarianism or dogmatism. 

Year in college is significantly correlated with F- and D-Scale scores. Again, 
however, the coefficients are so small (- 15 and -.13) as to explain little 
variability, although they do confirm the expectation from the review of 
literature that dogmatism and authoritarianism decline with education. The 
correlations between F and D scores and year in college were small enough 
to justify pooling undergraduate and graduate samples for some of the anal- 
yses reported on the following pages. 

As would be expected, year in college and age show a high correlation. 

The coefficient was probably not higher than .45 because it is common prac- 
tice for Mormon males to go on missions for the church either before entering 
college or after one or two years of college. This means that undergraduates 
at Utah State University tend to be older than is typical, and the U.S.U. 
sample constituted a large proportion of the total sample. The negative 
correlations between year in college and sex and age might also be mentioned. 
One would expect to find more males in the later years of college, espe- 
cially among graduate students The negative correlation between sex and 
age undoubtedly reflects the tendency for more males to take graduate courses, 
as well as the missionary effect (few females go on missions) at Utah State 
University. 
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TABLE 17 

Correlations Among F-, D-, and R-Scale Scores, 
Age, Year in College, and Sex 
for the American Sample 



1 2 3 4 5 



1 . 


Year in college 




2. 


Age 


.45 


3. 


Sex 


-.21 


4. 


F-Scale 


-.15 


5. 


D-Scale 


-.13 


6. 


R-Scale 


-.05 



-.20 








.01 


-.06 






-.04 


-.07 


.70 




.04 


-.04 


.53 


.47 



Aside from the correlations, several analyses were carried out. Perhaps 
the best question from which to start reporting these is, "Were there signi- 
ficant differences among the different university samples on the three scales- 
authoritarianism, dogmatism, and rigidity?" Tables 18 through 23 present 
the results of analyses of variance to test the significance of mean differ- 
ences among the university and Peace Corps groups on the three scales and 
of the use of Duncan's New Multiple Range Test to test for the significance 
of the differences between pairs of means when the difference among means 
was significant at the .05 level. 

In all three analyses of variance (Tables 18, 19, and 20), the F-Ratio 
is significant beyond the .01 level, with the magnitude of the F-Ratio de- 
creasing from the F-Scale to the D-Scale to the R-Scale. Table 21 indicates 
that the difference in F-Scale means for the highest sample (Utah State Uni- 
versity, graduate students excluded) and the lowest (Harvard) was 27.73. 

The Utah State University mean was significantly higher (at the .05 level) 
than the mean for each of the other samples. Basically the same pattern held 
true for the dogmatism means (Table 22), with a difference between the U.S.u. 
and the Harvard means of 20.21. However, the U.S.U. mean is not signifi- 
cantly different from those of the two groups closest to it Boston Univer- 
sity and the University of Oklahoma. It is worth noting, too, that although 
the Harvard mean on the F-Scale was significantly lower than any other sam- 
ple mean, there are three means on the D-Scale that do not differ signifi- 
cantly from it (University of California, University of Washington, and the 

Peace Corps). 

The order of means for the R-Scale (Table 23) is quite different than 
for the other two scales, with the U.S.U. mean fourth in magnitude. This 
change in pattern is in line with the lower correlations between the R-Scale 
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TABLE 18 

Analysis of Variance for F-Scale Means of American Subjects 
Grouped by University and Peace Corps 



University 


N 


Test 

Mean 


Item 

Mean 


S.D. F-Ratio* 


1. Utah State 
(Graduate Ss. 


excluded) 390 


96.99 


3.46 


19.27 


2. Boston 


122 


92.99 


3.32 


19.63 


3. Oklahoma 


118 


90.58 


3.23 


20.95 


4. Michigan 


85 


83.74 


2.99 


20.95 


5. Ohio State 


317 


83.40 


2.98 


19.65 


6 . California 


66 


80.04 


2.86 


16.57 


7. Washington 


34 


79.18 


2.83 


18.04 


8 . Peace Corps 


57 


78.72 


2.81 


19.15 


9. Harvard 


108 


69.27 


2.47 


18.12 30.56** 



* d.f . = 8/1288 

** For d.f. * 8/1000, F 05 * 1.95, F qi = 2.53 



and the F- and D-Scales as compared to that between the F- and D-Scales. 

As might be expected with the lower F-Ratio, the magnitude of the difference 
between the two extreme means (University of Oklahoma and Harvard) is smaller 
(13.34) than was the case with the F- and D-Scales, and there are fewer sig- 
nificant differences between pairs of means. 

All in all, the analyses provide rather striking evidence of the fluc- 
tuations in scores on the three scales that can be expected in drawing samples 
from different universities. One caveat needs to be entered, however. The 
analysis of variance model assumes homogeneity among the group variances. 
Although Box (1953) has demonstrated that the analysis of variance with its 
F-Ratio is a robust test, its insensitivity to variance heterogeneity appears 
to decrease as the group sizes depart from equality. 

Many of the variances reported in the following tables are strikingly 
different, to the point that no test of significance is needed to confirm 
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TABLE 19 

Analysis of Variance for D-Scale Means of American Subjects 
Grouped by University and Peace Corps 



University 


N 


Mean 


S.D. 


F-Ratio* 


1. Utah State 

(Graduate Ss. excluded) 


390 


146.71 


23.18 




2. Boston 


122 


’42.63 


23.44 




3. Oklahoma 


118 


142.04 


23.86 




4. Ohio State 


317 


140.62 


22.77 




5. California 


66 


133.88 


21.04 




6. Michigan 


85 


133.87 


24.16 




7. Washington 


34 


131.53 


23.55 




8. Peace Corps 


57 


129.44 


22.22 




9 . Harvard 


108 


126.50 


23.69 


12.32** 


* d.f. = 8/1288 










** For d.f. = 8/1000, F gj = 


1.95, F.oi 


= 2.53 







the difference. Even in Table 18, where the standard deviations are, by 
inspection, quite similar (ranging from 16.57 to 20.95), there appears 
to be statistically significant heterogeneity, although this, too, is 
difficult to check with unequal observations per group. Following the 
procedure recommended by Hartley (Winer, 1962, pp. 93-94), an F max of 1.60 
was obtained for the Table 18 variances with d.f. = 9/389. An Fm ax of 1.00 
is significant at both the .05 and .01 levels with d.f. = 9/“- Cochran s 
C (Winer, 1962, p. 94-95) was also computed, and C - .13 with d.f. = 9 / 389 . 
With d.f. = 9/144 (the closest value in the table, Winer, 1962, p. 654), 

C must reach .14 to be significant at the .05 level and .15 to be s j9nifi- 
cant at the .01 level. The hypothesis of homogeneity of variances hardly 
seems tenable for the groups in Table 18. 

Many of the analyses that follow, then, will be plagued by heterogeneity 
among the variances accompanied by unequal numbers of observations per group. 
Tests of heterogeneity have not been computed because the variability among 
the standard deviations is usually obvious and because, in any event, no 
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TABLE 20 

Analysis of Variance for R-Scale Means of American Subjects 
Grouped by University and Peace Corps 



University 


N 


Mean 


S.D. 


F-Ratio* 


1 . 


Oklahoma 


118 


91.44 


15.74 




2. 


Boston 


122 


89.42 


15.48 




3. 


Michigan 


85 


89.10 


16.55 




4. 


Utah State 

(Graduate Ss. excluded) 


390 


88.99 


14.35 




5. 


Washington 


34 


84.71 


15.83 




6. 


Ohio State 


317 


84.50 


15.57 




7. 


California 


66 


82.88 


13.69 




8. 


Peace Corps 


57 


81.51 


14.92 




9. 


Harvard 


108 


78.10 


18.83 


9.42** 



* d.f . = 8/1289 

** For d.f. = 8/1000, F Q5 = 1.95, F Q1 = 2.53 

statistical program for analysis of variance with heterogeneous variances 
was available. Obviously, then, caution must be exercised in interpreting 
some of the analyses. Given the evidence of the robustness of the F-Test 
and the lack of knowledge as to the effect of departures from homogeneity 
of variances combined with unequal group size, it seems reasonable generally 
to accept the findings. In many instances, the striking differences between 
or among the means, or the obvious similarities, make the statistical tests 
almost irrelevant. 

It should also be noted that little attempt will be made to interpret 
the differing variances (standard deviations, as reported in the tables). 
Discussions of authoritarianism and dogmatism have typically been in terms 
of the central tendencies of groups. For example, it has been argued, and 
findings indicate, that conservative religious groups will have higher F-Scale 
means than liberal religious groups. Rokeach (1960) has argued and presented 
preliminary evidence that religious groups on both extremes of a conservative- 
liberal continuum will have high Dogmatism Scale means. On the other hand, theor- 
etical considerations of the relationship between, for example, religious 
faiths and variability of F or D scores have not been published. Given the 
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TABLE 21 

Mean Differences 3 on the F-Scale for American Subjects 
Grouped by University and Peace Corps 



University 1 


2 


3 


4 


5 


6 


7 


8 


9 


1 . Utah State 

(Graduate Ss. excl.) 


4.00* 


6.41* 


13.25* 


13.60* 


16.95* 


17.82* 


18.27* 


27.73* 


2. Boston 




2.41 


9.25* 


9.59* 


12.95* 


13.82* 


14.27* 


23.72* 


3. Oklahoma 






6.84* 


7.19* 


10.53* 


11.40* 


11.86* 


21 ,32* 


4. Michigan 








.34 


3.70 


4.56 


5.02 


14.47* 


5. Ohio State 










3.35 


4.22 


4.68 


14.13* 


6. California 












.87 


1.33 


10.78* 


7. Washington 














.46 


9.91* 


8. Peace Corps 
















9.45* 


9. Harvard 



















3 Due to rounding errors, the mean differences reported in this and the following 
tables may not agree exactly with those obtained by subtracting the means pre- 
sented in the analysis of variance tables. 



* Significant at the .05 level, Duncan's New Multiple Range Tc't. 



lack of theoretical speculation about variability, and the unconvincing nature 
of ex post facto interpretations, the focus of reporting will be on central 
tendency rather than dispersion. 

A couple of additional points of interest in regard to the means of the 
college groups bear mentioning. First, one might expect that Peace Corps 
volunteers, by virtue of their decision to dedicate a portion of their lives 
to working in alien cultures, would be less authoritarian, dogmatic, and/or 
rigid than college students in general. The analyses bear out this expec- 
tation. Peace Corps volunteers in the U.S.U. training program did have 
lower mean F, D, and R scores than any of the university samples, with the 
exception of Harvard. Not only were the Peace Corps means second smallest 
in magnitude in each case, but that mean was significantly lower than three 
other means on the F-Scale and four on the D- and R-Scales. There also was 
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TABLE 22 

Mean Differences on the F-Scale for American Subjects 
Grouped by University and Peace Corps 



University 1 


2 


3 


4 


5 


6 


7 


8 


9 


1 . 


Utah State 


4.08 


4.67 


6.09* 


12.83* 


12.84* 


15.18* 


17.27* 


20.21* 




(Graduate Ss. excl.) 


















2. 


Boston 




.59 


2.01 


8.75* 


8.76* 


11.10* 


13.19* 


16.13* 


3. 


Oklahoma 






1.42 


8.16* 


8.17* 


10.51* 


12.60* 


15.54* 


4. 


Ohio State 








6.74* 


6.75* 


9.09* 


11.18* 


14.12* 


5. 


California 










.008 


2.35 


4.44 


7.38 


6. 


Michigan 












2.34 


4. ^3 


7.37 


7. 


Washington 














2.09 


5.03 


8. 


Peace Corps 
















2.94 


9. 


Harvard 



















* Significant at the .05 level, Duncan's New Multiple Range Test 



a significant mean difference between the Peace Corps and the pooled univer- 
sity samples on each of the three scales (see Table 24). Any temptation to 
attribute lower authoritarianism, dogmatism, or rigidity to Peace Corps volun- 
teers as compared to college students in teacher education must be tempered by 
the knowledge that the Peace Corps means did not differ significantly from some 
pairs of college means, and that the Harvard mean was lower in all three ana- 
lyses — although significantly so only with the F-Scale . 

It is also interesting to compare the item means for the F-Scale as 
reported in Table 18 with those for the various previous studies reported 
in Table 1. The highest mean in Table 18 is only slightly higher than the 
lowest one in Table 1. No tests of significance were run between the two 
sets of means, but the slight overlap in the distributions (placing the means 
in Table 1 in rank order) is striking. 

Comparisons between the two tables have some relevance for questions 
about the authoritarianism of students in teacher education as compared to 
other college students. Certainly, there is little in these tables to 
suggest that teacher trainees are more authoritarian than other college 
students, and much to suggest that they may be less so. Of course, any such 
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TABLE 24 

Analysis of Variance for the F-, D-, and R-Scales 
Means of University and Peace Corps Subjects 

F-Scale 





N 


Mean 


S.D. 


F-Ratio* 


Peace Corps 


57 


78.72 


19.15 




University 


1348 


88.33 


21.22 


11.29** 



D-Scale 





N 


Mean 


S.D. 


F-Ratio* 


Peace Corps 


57 


129.44 


22.22 




University 


1348 


140.62 


23.70 


12.21** 




R-Scale 










N 


Mean 


S.D. 


F-Ratio* 


Peace Corps 


57 


81.51 


14.92 




University 


1348 


87.11 


15.91 


6.80** 



* d.f. - 1/1403 

** For d.f. = 1/1000, F 05 = 3.85, F^i = 6.66 
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TABLE 23 

Mean Differences on the R-Scale for American Subjects 
Grouped by University and Peace Corps 



University 1 


2 




4 


5 


6 


7 


8 


9 


1 . 


Oklahoma 


2.02 


2.34 


2.45 


6.74* 


6.94* 


8.56* 


9.93* 


13.34* 


2. 


Boston 




3.13 


4.29 


4.71 


4.91* 


6.54* 


7.91* 


11.32* 


3. 


Michigan 






.12 


4.40 


4.60* 


6.23* 


7.60* 


11.00* 


4. 


Utah State 
(Graduate Ss. excl.) 








4.28 


4.48* 


6.11* 


7.48* 


10.89* 


5. 


Washington 










.20 


1.83 


3.20 


6.60 


6. 


Ohio State 












1.63 


3.00 


6.40* 


7. 


California 














1.37 


4.78 


8. 


Peace Corps 
















• 

3.41 


9. 


Harvard 
















♦ 



* Significant at the .05 level, Duncan's New Multiple Range test. 



conclusion must be cautiously drawn because comparison of the two tables 
confounds the time factor (there may have been a general decrease in F- 
Scale scores in this society from 1950 to the present) and the subcultural 
factor to be explored next in this chapter. Obviously, samples drawn from 
the same universities at the same times are needed for valid comparisons 
between education and other college students. 



Religion 

The initial impetus for a study comparing different university samples 
came from a concern with the bias in central tendency that might be intro- 
duced by using a sample heavily dominated by one religious group— in this 
case. Mormons. The question about the impact of religious affiliation has 
remained of paramount concern, and looking at the data from that point of 
view is most revealing in terms of the influences that subcultural vari- 
ations can have on personality test norms. 

Table 25 reports the results of an analysis of variance comparing the 
F-Scale means of seventeen religious groupings, including those subjects 



